C Strings and my slow descent to madness

bluetomcat | karma 3230 | avg karma 3.81 · 2023-04-06 08:25:49

In well-written C, you don't work with strings the way you do in other HLLs. For example, extracting and copying substrings is something unnecessary, unless you want to modify the parent string. Otherwise, a substring is represented by a pointer and a size_t length, and can easily be printed that way via the "%.*s" printf specifier:

    const char *s = "Hello World!";
    const char *world = s + 6;
    size_t world_len = 5;
    printf("%.*s\n", world_len, world);

gpderetta | karma 12081 | avg karma 1.83 · 2023-04-06 09:16:23

On other HLLs it is easy to have subviews on other strings. C makes is needlessly hard by requiring null termination in half the APIs.

tom_ | karma None | avg karma None · 2023-04-06 10:18:02

* consumes an int, not a size_t: https://port70.net/~nsz/c/c11/n1570.html#7.21.6.1p5

TremendousJudge | karma 2596 | avg karma 2.25 · 2023-04-06 13:58:44

I love how in every C code snippet on every comment on this thread, somebody got something wrong. I take it as a sign that it's probably best to avoid C as much as possible.

giantrobot | karma 5286 | avg karma 2.0 · 2023-04-06 16:32:43

! multithreaded it's not Hey, at least code

avar | karma 16410 | avg karma 5.06 · 2023-04-06 17:30:14

Any C compiler that isn't a trivial toy implementation will warn about that, so it's hardly a C gotcha.

tom_ | karma None | avg karma None · 2023-04-06 19:48:37

clang and VC++ for x64 warn about this out of the box, but gcc seems to need -Wall.

lolcatuser | karma 29 | avg karma 1.93 · 2023-04-06 17:56:43

Maybe my least favorite "feature" of C. I can manage most aspects of zero-terminated strings well enough, but when I have to specify the length of them, is it an 'int', 'size_t', 'ssize_t', or something else? (Answer: All of the above!)

coldpie | karma 19803 | avg karma 4.39 · 2023-04-06 08:26:21

It's unfortunate the author put the arrays-are-pointers thing so early in the doc, as that's a very beginner-to-C mixup and really nothing at all to do with strings. Otherwise, yep. It's pretty bad. C is a great language, but its string handling is definitely garbage. You get used to it pretty quick, and it's not hard to write a handful of sane wrappers or a simple string library for your own use, but the standard library's terrible string functions are an unending source of bugs.

gpderetta | karma 12081 | avg karma 1.83 · 2023-04-06 09:21:41

I don't see any mention or insinuations of arrays-are-pointers anywhere in the article. Am I missing something?

coldpie | karma 19803 | avg karma 4.39 · 2023-04-06 09:38:41

This bit:

    But you might be asking. “Why can’t I just assign the source variable directly to the destination variable?”

    int main() {
      char source[] = "Hello, world!";
      char* destination = source;
    
      strcpy(destination, source); // Copy the source string to the destination string
    
      printf("Source: %s\n", source);
      printf("Destination: %s\n", destination);
    
      return 0;
    }
    You can. It’s just that destination now becomes a char* and exists as a pointer to the source character array. If that isn’t what you want them this will almost certainly cause issues.

unwind | karma 11576 | avg karma 4.04 · 2023-04-06 11:52:12

This is almost a cliche among many C language lawyers and/or Stack Overflow answer-rich people and I know you mean well, but: arrays are not pointers.

In some contexts, the name of an array decays to a pointer to its first element. That is a better way of putting it, and it's a (much) weaker statement.

Edit: if they were the same, this code:

    int foo[] = {1, 2, 3};
    int *bar = foo;
    printf("%zu and %zu\n", sizeof foo, sizeof bar);

Would print the same valde twice, but it doesn't. On Ideone [1] I got 12 and 8.