Author:
- Name: Jyrki Holopainen
Location: FI - Republic of Finland (Finland)
To build:
If you have gcc < 2.3.3 you can build this entry like:
make all
This will try to create a file whose actual filename is literally:
";main(){puts("Hello World!");}char*C=".c
When that file is compiled, it will greet the world in the traditional fashion of C.
For an explanation of why and how this works see the author’s remarks. The gist of the entry itself, however, is the file name itself is part of the code.
NOTE: We delete the temporary file as part of make all
whether
or not the compilation succeeds. This is a Makefile feature
that might bug anyone running the make command. :-)
This entry will not compile with gcc version >= 2.3.3 as it relied on a bug which was later fixed in gcc 2.3.3, and that happened a long time ago (.. in a galaxy far, far away?).
If make all
fails to compile, a suggestion of compiling and running
the alternate version is printed. This alternate version is a variant
for those of us with gcc >= 2.3.3 or using clang. See Alternate
code section below for more details.
Bugs and (Mis)features:
The current status of this entry is:
STATUS: INABIAF - please DO NOT fix
For more detailed information see 1993/lmfjyh in bugs.html.
To use:
If you have gcc < 2.3.3 (i.e. the entry can compile):
./lmfjyh
Alternate code:
This version does what the code did but in a way that will work with modern compilers. We’d like to say there’s something special about it but there isn’t.
Alternate build:
Should you wish to build this version manually:
make alt
Alternate use:
Use lmfjyh.alt
as you would lmfjyh
above. Note that other code could also be
done with the bug that the entry abuses but that would have to be manually
programmed for this version; see the author’s remarks for more details.
Judges’ remarks:
There’s more than one way to get around disk quotas. :-)
Author’s remarks:
“Practical and efficient method of compressing source code”.
Introduction
I will shortly present a totally new compression method. It may be used to compress any types of files, but it works best with C-source code.
As an example, I have taken a simple “Hello world!” program that should be familiar to all C-programmers.
Listing 1: A simple “Hello World” program
main()
{
puts("Hello World!");
}
Packing method
Normally a file name is used only to identify the file, but this new revolutionary method introduces a totally new concept: THE FILE NAME IS THE PROGRAM. There is no need to waste valuable disk space to store source code. The program is embedded in the file name, only a minor portion of it is inside the file.
Listing 2: Compressed “Hello world”
char*_=__FILE__;
Listing 3: Code embedded in the file name
";main(){puts("Hello World!");}char*C=".c
The sample program in listing 1 is 36 bytes long. By using the new compression method, the example program may be squeezed to sixteen bytes. In the example the compression ratio is 2:1. It is evident that the longer the program is, the better the compression ratio is. The maximum compression ratio in a UNIX environment is 254:16 which equals approximately to 16:1. By using the method a one megabyte of source code file can be compressed to 64 kilobytes.
One might easily think that the program could be compressed to a
single byte by writing a one character long preprocessor directive
in the source file and defining the actual source code in the
command line, e.g. cc hello.c -DX='main(){puts ...'
. With this
method nothing is gained, since the compiling commands must be
stored in a file, and that takes even more space than writing the
code in a source file as presented in listing 1.
Packing files in a UNIX environment
In a UNIX environment it is possible to write almost any program by
including the code in file names. Since /
is used to separate
path components and cannot therefore exist in a file name, all
features of the C language cannot be used.
In C, the /
is used as a division operator and it may also be a
part of a comment start and end identifier. This, however, is not a
dangerous limitation, since practice has shown that comments are
almost totally unnecessary. A real programmer can more easily
understand pure source code, and misleading comments are
disastrous. The division operation may be replaced with pow(x, -1)
and multiplication when using real numbers or with div()
function
when using integers.
Large programs must be separated into small modules. This enforces people to write better programs, since monolithic programming style cannot be used. Program development will also be faster, no extra code needs be recompiled after changes have been made.
Usage with currently available tools
I have tested how well this new method works with currently available programming tools.
Any ANSI conforming C-compiler that runs under a REAL operating system should be able to compile the code (there might be problems with some older UNIX variants).
Compilers
These worked:
- gcc versions 2.0, 2.1, 2.3.2 (ld failed at AIX with gcc 2.1)
- cc (HP-UX)
- cc (VAX)
These failed miserably:
- cc (SunOS/Solaris) - could not start cpp properly
- cc/xlc (AIX) - could not open source
Other tools
Debuggers: (tested at SunOS 4.1.2)
dbx
- Quit with a very ‘informative’ error message:"dbx: fatal error: Is a directory"
gdb
- Printed almost the same error message": Is a directory"
. It was, however, possible to step through the code but not list it.
Lints
None of the lints tested environments were able to parse the file.
Conclusions
The method can be used to compress any type of file and the compression ratios are even better when compressing files other than C source. For example plain text files may be compressed with INFINITE compression ratio. It is trivial task to compress a 100 kilobytes long text file in about 400 files whose lengths are zero bytes. The drawback is that the files must be unpacked before they can be used, so a large amount of temporary disk space is needed, while C source is totally usable in the compressed form.
Evidently, there is a long way to go before this new programming technique may be widely used, since currently there are only few tools supporting it. The advantages over the traditional methods are, however, so great that there is no doubt that this method will be new standard.
Inventory for 1993/lmfjyh
Primary files
- lmfjyh.c - entry source code
- Makefile - entry Makefile
- lmfjyh.alt.c - alternate source code
- lmfjyh.orig.c - original source code
Secondary files
- 1993_lmfjyh.tar.bz2 - download entry tarball
- README.md - markdown source for this web page
- .entry.json - entry summary and manifest in JSON
- .gitignore - list of files that should not be committed under git
- .path - directory path from top level directory
- index.html - this web page