Author:
- Name: Szabolcs Borsanyi
Location: DE - Federal Republic of Germany (Germany)
To build:
make
Bugs and (Mis)features:
The current status of this entry is:
STATUS: INABIAF - please DO NOT fix
For more detailed information see 2011/borsanyi in bugs.html.
To use:
./borsanyi < some_data_file
Try:
./try.sh
Judges’ remarks:
This is a nice quick histogram plotter. A useful utility when you need to look at the distribution of some data.
Challenges:
You can modify the number of histogram bins and the scaling of the histogram by editing the source. Try to find them!
If you look at the source you will notice that the usual array of
histogram bins is missing. The source contains no arrays and it does not
use malloc(3)
or alloca(3)
. How does it build the histogram?
While the input does not need to be sorted, the histogram output is sorted.
How does it put the data into buckets? Why does it fork(2)
22 times?
Author’s remarks:
This small piece of program is meant to be a useful tool, when there are some data around and you need a quick glance at the histogram. I am all too often logged in to a remote machine with slow X connection, and a terminal-based program would be of great help.
Limitations
- I optimized the output for the data I had around, you might want to adjust for your needs. For a great excess of data points the program will crash (depending on the available stack).
- Giving too few (distinct) points can also cause crash (or division by zero).
- The program expects plain numbers separated by white spaces.
- ASCII is assumed.
- Some POSIX-like system is assumed (
fork()
andwait(NULL)
are used). - Do not give any arguments to the program.
- Could rounding errors cause the omission of the highest bin? There might be empty bins at the edges.
Obfuscation
- The program follows a rather unconventional approach to the task of histogram building. The usual array of integers (for the histogram bins) is missing entirely. The data are read once, the range is determined, but the data points are not put in any array. Yet they are remembered, and each of them is put into its own bin afterwards. But where are the bins?
- I tried to use as few keywords as possible, hence there are no loops.
- Multi-process approach for optimal use of resources.
- Some bin size obfuscation.
Inventory for 2011/borsanyi
Primary files
- borsanyi.c - entry source code
- Makefile - entry Makefile
- borsanyi.orig.c - original source code
- try.sh - script to try entry
- data.1 - sample input
- data.2 - sample input
- data.3 - sample input
Secondary files
- 2011_borsanyi.tar.bz2 - download entry tarball
- README.md - markdown source for this web page
- .entry.json - entry summary and manifest in JSON
- .gitignore - list of files that should not be committed under git
- .path - directory path from top level directory
- index.html - this web page