【问题标题】:Heap buffer overflow on sprintfsprintf 上的堆缓冲区溢出
【发布时间】:2020-04-27 08:10:09
【问题描述】:

我在此代码上收到 heap-buffer-overflow 错误:

// ast.c
char *not_last_prefix = malloc(strlen(next_prefix) + 4); // line 204

sprintf(not_last_prefix, "%s│  ", next_prefix); // line 206
=================================================================
==3394==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x602000000279 at pc 0x7f0d9e6d7715 bp 0x7fff975bcf60 sp 0x7fff975bc6f0
WRITE of size 11 at 0x602000000279 thread T0
    #0 0x7f0d9e6d7714 in vsprintf (/lib/x86_64-linux-gnu/libasan.so.5+0x9e714)
    #1 0x7f0d9e6d7bce in sprintf (/lib/x86_64-linux-gnu/libasan.so.5+0x9ebce)
    #2 0x55708e40b909 in print_ast_impl src/ast.c:206
    #3 0x55708e40b7ef in print_ast src/ast.c:192
    #4 0x55708e4112ad in main src/main.c:50
    #5 0x7f0d9e46f1e2 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x271e2)
    #6 0x55708e40a5cd in _start (/home/michael/Code/Baby-C/debug/bcc+0x65cd)

0x602000000279 is located 0 bytes to the right of 9-byte region [0x602000000270,0x602000000279)
allocated by thread T0 here:
    #0 0x7f0d9e746ae8 in malloc (/lib/x86_64-linux-gnu/libasan.so.5+0x10dae8)
    #1 0x55708e40b8cd in print_ast_impl src/ast.c:204
    #2 0x55708e40b7ef in print_ast src/ast.c:192
    #3 0x55708e4112ad in main src/main.c:50
    #4 0x7f0d9e46f1e2 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x271e2)

SUMMARY: AddressSanitizer: heap-buffer-overflow (/lib/x86_64-linux-gnu/libasan.so.5+0x9e714) in vsprintf
Shadow bytes around the buggy address:
  0x0c047fff7ff0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
  0x0c047fff8000: fa fa 00 fa fa fa 02 fa fa fa 00 00 fa fa 00 00
  0x0c047fff8010: fa fa 02 fa fa fa 00 00 fa fa 00 00 fa fa 02 fa
  0x0c047fff8020: fa fa 00 00 fa fa 00 00 fa fa 02 fa fa fa 02 fa
  0x0c047fff8030: fa fa 02 fa fa fa 02 fa fa fa 02 fa fa fa 02 fa
=>0x0c047fff8040: fa fa 02 fa fa fa fd fa fa fa 00 01 fa fa 00[01]
  0x0c047fff8050: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
  0x0c047fff8060: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
  0x0c047fff8070: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
  0x0c047fff8080: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
  0x0c047fff8090: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa
Shadow byte legend (one shadow byte represents 8 application bytes):
  Addressable:           00
  Partially addressable: 01 02 03 04 05 06 07 
  Heap left redzone:       fa
  Freed heap region:       fd
  Stack left redzone:      f1
  Stack mid redzone:       f2
  Stack right redzone:     f3
  Stack after return:      f5
  Stack use after scope:   f8
  Global redzone:          f9
  Global init order:       f6
  Poisoned by user:        f7
  Container overflow:      fc
  Array cookie:            ac
  Intra object redzone:    bb
  ASan internal:           fe
  Left alloca redzone:     ca
  Right alloca redzone:    cb
  Shadow gap:              cc
==3394==ABORTING

我能找到的一切都表明我没有为sprintf 的结果分配足够的空间,但我看不出这是怎么回事。我为next_prefix 的长度分配空间,为它后面的"│ " 分配3 个字节,为NULL 终止符分配1 个字节。结果字符串应该适合。我在这里错过了什么?

【问题讨论】:

  • 天哪,你是对的。该字符串中有 is 一些 unicode 废话,这不是正常的竖线。我觉得很愚蠢,我认为这一定是问题所在。
  • 这么简单的格式化,为什么不用strncpy()strncat()呢?不过,它也会有同样的问题。
  • @thebusybee 我考虑过,但sprintf 似乎有更清晰的语法 - 这样做有什么缺点吗?
  • snprintf() 具有 聪明的 语法和防止缓冲区溢出的保险。
  • @wildplasser 我也考虑过snprintf - 这可以防止缓冲区溢出,但产生的错误实际上更难捕获,因为它会导致在不正确的行为中没有错误。

标签: c


【解决方案1】:

问题是字符串文字的长度不是3,而是5。这是因为竖线不是标准的ASCII字符,而是unicode字符(UTF-8编码为三个字节) .

为避免此类问题,应将文字分配给 char * 并取其长度,如下所示

char *separator = "│  ";
char *not_last_prefix = malloc(strlen(next_prefix) + strlen(separator) + 1);
sprintf(not_last_prefix, "%s%s", next_prefix, separator); 

【讨论】:

    【解决方案2】:

    正如向我指出的那样,问题是我的格式字符串包含一个 unicode 字符。我错误地认为再分配一个字节就可以解决问题 - 结果是 UTF-8 字符可以长达 4 个字节!好消息是,您可以通过查看这个简单的表格 (found here) 来准确检查它们占用了多少字节。

    Character code (decimal) | Bytes used
    -------------------------|------------
    0-127                    | 1 byte
    128-2047                 | 2 bytes
    2048-65535               | 3 bytes
    65536-1114111            | 4 bytes
    

    在我的例子中,我使用的竖线字符 () 是 unicode "\u2502",这意味着它占用了 3 个字节!

    【讨论】:

    • 显然 StackOverflow 不支持 Markdown 表??
    猜你喜欢
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2010-11-11
    • 2017-02-23
    • 2021-11-02
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多