source: zzuf/trunk/doc/zzuf.1 @ 1867

Last change on this file since 1867 was 1865, checked in by Sam Hocevar, 16 years ago
  • Documentation updates.
File size: 18.8 KB
Line 
1.TH zzuf 1 "2006-12-22" "zzuf"
2.SH NAME
3zzuf \- multiple purpose fuzzer
4.SH SYNOPSIS
5\fBzzuf\fR [\fB\-AcdimnqSvx\fR] [\fB\-s\fR \fIseed\fR|\fB\-s\fR \fIstart:stop\fR] [\fB\-r\fR \fIratio\fR|\fB\-r\fR \fImin:max\fR]
6.br
7       [\fB\-f\fR \fIfuzzing\fR] [\fB\-D\fR \fIdelay\fR] [\fB\-F\fR \fIforks\fR] [\fB\-C\fR \fIcrashes\fR] [\fB\-B\fR \fIbytes\fR]
8.br
9       [\fB\-t\fR \fIseconds\fR] [\fB\-T\fR \fIseconds\fR] [\fB\-M\fR \fImegabytes\fR] [\fB\-b\fR \fIranges\fR] [\fB\-p\fR \fIports\fR]
10.br
11       [\fB\-P\fR \fIprotect\fR] [\fB\-R\fR \fIrefuse\fR] [\fB\-l\fR \fIlist\fR] [\fB\-I\fR \fIinclude\fR] [\fB\-E\fR \fIexclude\fR]
12.br
13       [\fIPROGRAM\fR [\fB\-\-\fR] [\fIARGS\fR]...]
14.br
15\fBzzuf \-h\fR | \fB\-\-help\fR
16.br
17\fBzzuf \-V\fR | \fB\-\-version\fR
18.SH DESCRIPTION
19.PP
20\fBzzuf\fR is a transparent application input fuzzer. It works by intercepting
21file and network operations and changing random bits in the program's input.
22\fBzzuf\fR's behaviour is deterministic, making it easy to reproduce bugs.
23.SH USAGE
24.PP
25\fBzzuf\fR will run an application specified on its command line, one or
26several times, with optional arguments, and will report the application's
27relevant behaviour on the standard error channel, eg:
28.PP
29\fB    zzuf cat /dev/zero\fR
30.PP
31If you want to specify flags for your application, put a \(oq\fB\-\-\fR\(cq
32marker before them on the command line (otherwise \fBzzuf\fR will try to
33interpret them as arguments for itself), eg:
34.PP
35\fB    zzuf \-B 1000 cat \-\- \-v /dev/zero\fR
36.PP
37When no program is specified, \fBzzuf\fR simply fuzzes the standard input, as
38if the \fBcat\fR utility had been called:
39.PP
40\fB    zzuf < /dev/zero\fR
41.SH OPTIONS
42.TP
43\fB\-A\fR, \fB\-\-autoinc\fR
44Increment random seed each time a new file is opened. This is only required
45if one instance of the application is expected to open the same file several
46times and you want to test a different seed each time.
47.TP
48\fB\-b\fR, \fB\-\-bytes\fR=\fIranges\fR
49Restrict fuzzing to bytes whose offsets in the file are within \fIranges\fR.
50
51Range values start at zero and are inclusive. Use dashes between range values
52and commas between ranges. If the right-hand part of a range is ommited, it
53means end of file. For instance, to restrict fuzzing to bytes 0, 3, 4, 5 and
54all bytes after offset 31, use \(oq\fB\-r0,3\-5,31\-\fR\(cq.
55
56This option is useful to preserve file headers or corrupt only a specific
57portion of a file.
58.TP
59\fB\-B\fR, \fB\-\-max\-bytes\fR=\fIn\fR
60Automatically terminate child processes that output more than \fIn\fR bytes
61on the standard output and standard error channels. This is useful to detect
62infinite loops. See also the \fB\-t\fR and \fB\-T\fR flags.
63.TP
64\fB\-c\fR, \fB\-\-cmdline\fR
65Only fuzz files whose name is specified in the target application's command
66line. This is mostly a shortcut to avoid specifying twice the argument:
67
68\fB    zzuf \-c cat file.txt\fR
69
70has the same effect as
71
72\fB    zzuf \-I \(aq^file\\.txt$\(aq cat file.txt\fR
73
74See the \fB\-I\fR flag for more information on restricting fuzzing to
75specific files.
76.TP
77\fB\-C\fR, \fB\-\-max\-crashes\fR=\fIn\fR
78Stop forking when at least \fIn\fR children have crashed. The default value
79is 1, meaning \fBzzuf\fR will stop as soon as one child has crashed. A process
80is considered to have crashed if any signal (such as, but not limited to,
81\fBSIGSEGV\fR) caused it to exit. If the \fB\-x\fR flag is used, this will
82also include processes that exit with a non-zero status.
83
84This option is only relevant if the \fB\-s\fR flag is used with a range
85argument.
86.TP
87\fB\-d\fR, \fB\-\-debug\fR
88Activate the display of debug messages.
89.TP
90\fB\-D\fR, \fB\-\-delay\fR=\fIdelay\fR
91Do not launch more than one process every \fIdelay\fR seconds. This option
92should be used together with \fB\-F\fR to avoid fork bombs.
93.TP
94\fB\-E\fR, \fB\-\-exclude\fR=\fIregex\fR
95Do not fuzz files whose name matches the \fIregex\fR regular expression. This
96option supersedes anything that is specified by the \fB\-I\fR flag. Use this
97for instance if you are unsure of what files your application is going to read
98and do not want it to fuzz files in the \fB/etc\fR directory.
99
100Multiple \fB\-E\fR flags can be specified, in which case files matching any one
101of the regular expressions will be ignored.
102.TP
103\fB\-f\fR, \fB\-\-fuzzing\fR=\fImode\fR
104Select how the input is fuzzed. Valid values for \fImode\fR are:
105.RS
106.TP
107\fBxor\fR
108randomly set and unset bits
109.TP
110\fBset\fR
111only set bits
112.TP
113\fBunset\fR
114only unset bits
115.RE
116.IP
117The default value for \fImode\fR is \fBxor\fR.
118.TP
119\fB\-F\fR, \fB\-\-max\-forks\fR=\fIforks\fR
120Specify the number of simultaneous children that can be run.
121
122This option is only relevant if the \fB\-s\fR flag is used with a range
123argument. See also the \fB\-D\fR flag.
124.TP
125\fB\-i\fR, \fB\-\-stdin\fR
126Fuzz the application's standard input. By default \fBzzuf\fR only fuzzes files.
127.TP
128\fB\-I\fR, \fB\-\-include\fR=\fIregex\fR
129Only fuzz files whose name matches the \fIregex\fR regular expression. Use
130this for instance if your application reads configuration files at startup
131and you only want specific files to be fuzzed.
132
133Multiple \fB\-I\fR flags can be specified, in which case files matching any one
134of the regular expressions will be fuzzed. See also the \fB\-c\fR flag.
135.TP
136\fB\-l\fR, \fB\-\-list\fR=\fIlist\fR
137Cherry-pick the list of file descriptors that get fuzzed. The Nth descriptor
138will really be fuzzed only if N is in \fIlist\fR.
139
140Values start at one and ranges are inclusive. Use dashes between values and
141commas between ranges. If the right-hand part of a range is ommited, it means
142all subsequent file descriptors. For instance, to restrict fuzzing to the
143first opened descriptor and all descriptors starting from the 10th, use
144\(oq\fB\-l1,10\-\fR\(cq.
145
146Note that this option only affects file descriptors that would otherwise be
147fuzzed. Even if 10 write-only descriptors are opened at the beginning of the
148program, only the next descriptor with a read flag will be the first one
149considered by the \fB\-l\fR flag.
150.TP
151\fB\-m\fR, \fB\-\-md5\fR
152Instead of displaying the program's \fIstandard output\fR, just print its MD5
153digest to \fBzzuf\fR's standard output. The standard error channel is left
154untouched.
155.TP
156\fB\-M\fR, \fB\-\-max\-memory\fR=\fImegabytes\fR
157Specify the maximum amount of memory, in megabytes, that children are allowed
158to allocate. This is useful to detect infinite loops that eat up a lot of
159memory. The value should set reasonably high so as not to interfer with normal
160program operation.
161
162\fBzzuf\fR uses the \fBsetrlimit\fR() call to set memory usage limitations and
163relies on the operating system's ability to enforce such limitations.
164.TP
165\fB\-n\fR, \fB\-\-network\fR
166Fuzz the application's network input. By default \fBzzuf\fR only fuzzes files.
167
168Only INET (IPv4) and INET6 (IPv6) connections are fuzzed. Other protocol
169families are not yet supported.
170.TP
171\fB\-p\fR, \fB\-\-ports\fR=\fIranges\fR
172Only fuzz network ports that are in \fIranges\fR. By default \fBzzuf\fR
173fuzzes all ports. The port considered is the listening port if the socket
174is listening and the destination port if the socket is connecting, because
175most of the time the source port cannot be predicted.
176
177Range values start at zero and are inclusive. Use dashes between range values
178and commas between ranges. If the right-hand part of a range is ommited, it
179means end of file. For instance, to restrict fuzzing to the HTTP and HTTPS
180ports and to all unprivileged ports, use \(oq\fB\-p80,443,1024\-\fR\(cq.
181
182This option requires network fuzzing to be activated using \fB\-n\fR.
183.TP
184\fB\-P\fR, \fB\-\-protect\fR=\fIlist\fR
185Protect a list of characters so that if they appear in input data that would
186normally be fuzzed, they are left unmodified instead.
187
188Characters in \fIlist\fR can be expressed verbatim or through escape sequences.
189The sequences interpreted by \fBzzuf\fR are:
190.RS
191.TP
192\fB\\n\fR
193new line
194.TP
195\fB\\r\fR
196return
197.TP
198\fB\\t\fR
199tabulation
200.TP
201\fB\\\fR\fINNN\fR
202the byte whose octal value is \fINNN\fR
203.TP
204\fB\\x\fR\fINN\fR
205the byte whose hexadecimal value is \fINN\fR
206.TP
207\fB\\\\\fR
208backslash (\(oq\\\(cq)
209.RE
210.IP
211You can use \(oq\fB\-\fR\(cq to specify ranges. For instance, to protect all
212bytes from \(oq\\001\(cq to \(oq/\(cq, use \(oq\fB\-P\ \(aq\\001\-/\(aq\fR\(cq.
213
214The statistical outcome of this option should not be overlooked: if characters
215are protected, the effect of the \(oq\fB\-r\fR\(cq flag will vary depending
216on the data being fuzzed. For instance, asking to fuzz 1% of input bits
217(\fB\-r0.01\fR) and to protect lowercase characters (\fB\-P\ a\-z\fR) will
218result in an actual average fuzzing ratio of 0.9% with truly random data,
2190.3% with random ASCII data and 0.2% with standard English text.
220
221See also the \fB\-R\fR flag.
222.TP
223\fB\-q\fR, \fB\-\-quiet\fR
224Hide the output of the fuzzed application. This is useful if the application
225is very verbose but only its exit code or signaled status is really useful to
226you.
227.TP
228\fB\-r\fR, \fB\-\-ratio\fR=\fIratio\fR
229.PD 0
230.TP
231\fB\-r\fR, \fB\-\-ratio\fR=\fImin:max\fR
232.PD
233Specify the proportion of bits that will be randomly fuzzed. A value of 0
234will not fuzz anything. A value of 0.05 will fuzz 5% of the open files'
235bits. A value of 1.0 or more will fuzz all the bytes, theoretically making
236the input files undiscernible from random data. The default fuzzing ratio
237is 0.004 (fuzz 0.4% of the files' bits).
238
239A range can also be specified. When doing so, \fBzzuf\fR will pick ratio
240values from the interval. The choice is deterministic and only depends on
241the interval bounds and the current seed.
242.TP
243\fB\-R\fR, \fB\-\-refuse\fR=\fIlist\fR
244Refuse a list of characters by not fuzzing bytes that would otherwise be
245changed to a character that is in \fIlist\fR. This does not prevent characters
246from appearing in the output if the original byte was already in \fIlist\fR.
247
248See the \fB\-P\fR option for a description of \fIlist\fR.
249.TP
250\fB\-s\fR, \fB\-\-seed\fR=\fIseed\fR
251.PD 0
252.TP
253\fB\-s\fR, \fB\-\-seed\fR=\fIstart:stop\fR
254.PD
255Specify the random seed to use for fuzzing, or a range of random seeds.
256Running \fBzzuf\fR twice with the same random seed will fuzz the files exactly
257the same way, even with a different target application. The purpose of this is
258to use simple utilities such as \fBcat\fR or \fBcp\fR to generate a file that
259causes the target application to crash.
260
261If a range is specified, \fBzzuf\fR will run the application several times,
262each time with a different seed, and report the behaviour of each run.
263.TP
264\fB\-S\fR, \fB\-\-signal\fR
265Prevent children from installing signal handlers for signals that usually
266cause coredumps. These signals are \fBSIGABRT\fR, \fBSIGFPE\fR, \fBSIGILL\fR,
267\fBSIGQUIT\fR, \fBSIGSEGV\fR, \fBSIGTRAP\fR and, if available on the running
268platform, \fBSIGSYS\fR, \fBSIGEMT\fR, \fBSIGBUS\fR, \fBSIGXCPU\fR and
269\fBSIGXFSZ\fR. Instead of calling the signal handler, the application will
270simply crash. If you do not want core dumps, you should set appropriate limits
271with the \fBlimit coredumpsize\fR command. See your shell's documentation on
272how to set such limits.
273.TP
274\fB\-t\fR, \fB\-\-max\-time\fR=\fIn\fR
275Automatically terminate child processes that run for more than \fIn\fR
276seconds. This is useful to detect infinite loops or processes stuck in other
277situations. See also the \fB\-B\fR and \fB\-T\fR flags.
278.TP
279\fB\-T\fR, \fB\-\-max\-cpu\fR=\fIn\fR
280Automatically terminate child processes that use more than \fIn\fR seconds
281of CPU time.
282
283\fBzzuf\fR uses the \fBsetrlimit\fR() call to set CPU usage limitations and
284relies on the operating system's ability to enforce such limitations. If the
285system sends \fBSIGXCPU\fR signals and the application catches that signal,
286it will receive a \fBSIGKILL\fR signal after 5 seconds.
287
288This is more accurate than \fB\-t\fR because the behaviour should be
289independent from the system load, but it does not detect processes stuck into
290infinite \fBselect\fR() calls because they use very little CPU time. See also
291the \fB\-B\fR and \fB\-t\fR flags.
292.TP
293\fB\-v\fR, \fB\-\-verbose\fR
294Print information during the run, such as the current seed, what processes
295get run, their exit status, etc.
296.TP
297\fB\-x\fR, \fB\-\-check\-exit\fR
298Report processes that exit with a non-zero status. By default only processes
299that crash due to a signal are reported.
300.TP
301\fB\-h\fR, \fB\-\-help\fR
302Display a short help message and exit.
303.TP
304\fB\-V\fR, \fB\-\-version\fR
305Output version information and exit.
306.SH DIAGNOSTICS
307.PP
308Exit status is zero if no child process crashed. If one or several children
309crashed, \fBzzuf\fR exits with status 1.
310.SH EXAMPLES
311.PP
312Fuzz the input of the \fBcat\fR program using default settings:
313.PP
314\fB    zzuf cat /etc/motd\fR
315.PP
316Fuzz 1% of the input bits of the \fBcat\fR program using seed 94324:
317.PP
318\fB    zzuf \-s94324 \-r0.01 cat /etc/motd\fR
319.PP
320Fuzz the input of the \fBcat\fR program but do not fuzz newline characters
321and prevent non-ASCII characters from appearing in the output:
322.PP
323\fB    zzuf \-P \(aq\\n\(aq \-R \(aq\\x00\-\\x1f\\x7f\-\\xff\(aq cat /etc/motd\fR
324.PP
325Fuzz the input of the \fBconvert\fR program, using file \fBfoo.jpeg\fR as the
326original input and excluding \fB.xml\fR files from fuzzing (because
327\fBconvert\fR will also open its own XML configuration files and we do not
328want \fBzzuf\fR to fuzz them):
329.PP
330\fB    zzuf \-E \(aq\\.xml$\(aq convert \-\- foo.jpeg \-format tga /dev/null\fR
331.PP
332Fuzz the input of VLC, using file \fBmovie.avi\fR as the original input
333and restricting fuzzing to filenames that appear on the command line
334(\fB\-c\fR), then generate \fBfuzzy\-movie.avi\fR which is a file that
335can be read by VLC to reproduce the same behaviour without using
336\fBzzuf\fR:
337.PP
338\fB    zzuf \-c \-s87423 \-r0.01 vlc movie.avi\fR
339.br
340\fB    zzuf \-c \-s87423 \-r0.01 <movie.avi >fuzzy\-movie.avi\fR
341.br
342\fB    vlc fuzzy\-movie.avi\fR
343.PP
344Fuzz between 0.1% and 2% of MPlayer's input bits (\fB\-r0.001:0.02\fR)
345with seeds 0 to 9999 (\fB\-s0:10000\fR), preserving the AVI 4-byte header
346by restricting fuzzing to offsets after 4 (\fB\-b4\-\fR), disabling its
347standard output messages (\fB\-q\fR), launching up to five simultaneous child
348processes (\fB\-F5\fR) but waiting at least half a second between launches
349(\fB\-D0.5\fR), killing MPlayer if it takes more than one minute to
350read the file (\fB\-T60\fR) and disabling its \fBSIGSEGV\fR signal handler
351(\fB\-S\fR):
352.PP
353\fB    zzuf \-c \-r0.001:0.02 \-s0:10000 \-b4\- \-q \-F5 \-D0.5 \-T60 \-S \\\fR
354.br
355\fB      mplayer \-\- \-benchmark \-vo null \-fps 1000 movie.avi\fR
356.PP
357Create an HTML-like file that loads 200 times the same \fBhello.jpg\fR image
358and open it in Firefox\(tm in auto-increment mode (\fB\-A\fR):
359.PP
360\fB    seq \-f \(aq<img src="hello.jpg#%g">\(aq 1 200 > hello.html\fR
361.br
362      (or: \fBjot \-w \(aq<img src="hello.jpg#%d">\(aq 200 1 > hello.html\fR)
363.br
364\fB    zzuf \-A \-I \(aqhello[.]jpg\(aq \-r0.001 firefox hello.html\fR
365.PP
366Run a simple HTTP redirector on the local host using \fBsocat\fR and
367corrupt each network connection (\fB\-n\fR) in a different way (\fB\-A\fR)
368after one megabyte of data was received on it (\fB\-b1000000\-\fR):
369.PP
370\fB     zzuf \-n \-A \-b1000000\- \\\fR
371\fB       socat TCP4\-LISTEN:8080,reuseaddr,fork TCP4:192.168.1.42:80\fR
372.PP
373Browse the intarweb (\fB\-n\fR) using Firefox\(tm without fuzzing local files
374(\fB\-E.\fR) or non-HTTP connections (\fB\-p80,8010,8080\fR), preserving
375the beginning of the data sent with each HTTP response (\fB\-b4000\-\fR)
376and using another seed on each connection (\fB\-A\fR):
377.PP
378\fB    zzuf \-r 0.0001 \-n \-E. \-p80,8010,8080 \-b4000\- \-A firefox\fR
379.SH RESTRICTIONS
380.PP
381Due to \fBzzuf\fR using shared object preloading (\fBLD_PRELOAD\fR,
382\fB_RLD_LIST\fB, \fBDYLD_INSERT_LIBRARIES\fR, etc.) to run its child
383processes, it will fail in the presence of any mechanism that disables
384preloading. For instance setuid root binaries will not be fuzzed when run
385as an unprivileged user.
386.PP
387For the same reasons, \fBzzuf\fR will also not work with statically linked
388binaries. Bear this in mind when using \fBzzuf\fR on the OpenBSD platform,
389where \fBcat\fR, \fBcp\fR and \fBdd\fR are static binaries.
390.PP
391Though best efforts are made, identical behaviour for different versions of
392\fBzzuf\fR is not guaranteed. The reproducibility for subsequent calls on
393different operating systems and with different target programs is only
394guaranteed when the same version of \fBzzuf\fR is being used.
395.SH BUGS
396.PP
397\fBzzuf\fR probably does not behave correctly with 64-bit offsets.
398.PP
399It is not yet possible to insert or drop bytes from the input, to fuzz
400according to the file format, to swap bytes, etc. More advanced fuzzing
401methods are planned.
402.PP
403As of now, \fBzzuf\fR does not really support multithreaded applications. The
404behaviour with multithreaded applications where more than one thread does file
405descriptor operations is undefined.
406.SH NOTES
407In order to intercept file and network operations, signal handlers and memory
408allocations, \fBzzuf\fR diverts and reimplements the following functions,
409which can be private libc symbols, too:
410.TP
411Unix file descriptor handling:
412\fBopen\fR(), \fBlseek\fR(), \fBread\fR(), \fBreadv\fR(), \fBpread\fR(),
413\fBaccept\fR(), \fBsocket\fR(), \fBrecv\fR(), \fBrecvfrom\fR(), \fBrecvmsg\fR(),
414\fBaio_read\fR(), \fBaio_return\fR(), \fBclose\fR()
415.TP
416Standard IO streams:
417\fBfopen\fR(), \fBfreopen\fR(), \fBfseek\fR(), \fBfseeko\fR(), \fBrewind\fR(),
418\fBfread\fR(), \fBgetc\fR(), \fBfgetc\fR(), \fBfgets\fR(), \fBungetc\fR(),
419\fBfclose\fR()
420.TP
421Memory management:
422\fBmmap\fR(), \fBmunmap\fR(), \fBmalloc\fR(), \fBcalloc\fR(), \fBvalloc\fR(),
423\fBfree\fR(), \fBmemalign\fR(), \fBposix_memalign\fR()
424.TP
425Linux-specific:
426\fBopen64\fR(), \fBlseek64\fR(), \fBmmap64\fR(), \fB_IO_getc\fR(),
427\fBgetline\fR(), \fBgetdelim\fR(), \fB__getdelim\fR()
428.TP
429BSD-specific:
430\fBfgetln\fR(), \fB__srefill\fR()
431.TP
432Mac OS X-specific:
433\fBmap_fd\fR()
434.TP
435Signal handling:
436\fBsignal\fR(), \fBsigaction\fR()
437.PP
438If an application manipulates file descriptors (reading data, seeking around)
439using functions that are not in that list, \fBzzuf\fR will not fuzz its
440input consistently and the results should not be trusted. You can use a tool
441such as \fBltrace(1)\fR on Linux to know the missing functions.
442.PP
443On BSD systems, such as FreeBSD or Mac OS X, \fB__srefill\fR() is enough to
444monitor all standard IO streams functions. On other systems, such as Linux,
445each function is reimplemented on a case by case basis. One important
446unimplemented function is \fBfscanf\fR(), because of its complexity. Missing
447functions will be added upon user request.
448.SH HISTORY
449.PP
450\fBzzuf\fR started its life in 2002 as the \fBstreamfucker\fR tool, a small
451multimedia stream corrupter used to find bugs in the VLC media player.
452.SH AUTHOR
453.PP
454Copyright \(co 2002, 2007 Sam Hocevar <sam@zoy.org>.
455.PP
456\fBzzuf\fR and this manual page are free software. They come without any
457warranty, to the extent permitted by applicable law. You can redistribute
458them and/or modify them under the terms of the Do What The Fuck You Want
459To Public License, Version 2, as published by Sam Hocevar. See
460\fBhttp://sam.zoy.org/wtfpl/COPYING\fR for more details.
461.PP
462\fBzzuf\fR's webpage can be found at \fBhttp://sam.zoy.org/zzuf/\fR.
Note: See TracBrowser for help on using the repository browser.