Reverse engineering

The export directory

Dejan Lukan
April 18, 2013 by
Dejan Lukan

Example of the export directory

Let's take a look at a simple example to understand how the export directory is used by the executables/libraries. Let's suppose that we're dealing with a .dll library that has 10 exported functions, so the NumberOfFunctions=10. It has 5 names, so the NumberOfNames=5. Because it has 5 names, the number of elements in the AddressOfNames and AddressOfNameOrdinals is also 5. The picture below shows this example:

In the picture above, we can see that the AddressOfFunctions array holds the addresses of all 10 exported functions. The addresses are denoted with addr0...addr9. The AddressOfNames contains 5 names that correspond to certain addresses. Remember that there is no indication which name links to which address in the AddressOfNames array. We showed the arrows just so we can better illustrate the example. The linkage between the names and their appropriate addresses is written in the AddressOfNameOrdinals array. Each element of that array holds an index of the corresponding elements from the AddressOfNames array to the element in AddressOfFunctions array. The name4 function is the fifth name in the AddressOfNames array, which is why the fifth element in the AddressOfNameOrdinals array will hold the index of the corresponding address of that function in the AddressOfFunctions array.

There are two scenarios we need to go over. The first one is when the function is exported by name and we must find its address. The second one is when the function is exported by ordinal, and we must also find the function's address. When we want to figure out the address of the function exported by name, we have to traverse the AddressOfNames and AddressOfNameOrdinals at the same time. When we find a matching name in the AddressOfNames array, we must take the number at the same index from the AddressOfNameOrdinals array. The extracted number is the index into the AddressOfFunctions array where we can get the RVA of the associated function. When the function is exported by an ordinal, we can directly use the ordinal number as an index into the AddressOfFunctions array if the Base number is 0. In our case, the Base number is 1, which means that we have to take the ordinal number and subtract the Base number (which is 1 in this case) from it. This makes a lot of sense, because the ordinal numbers start counting from 1 onwards, but in C arrays we're always starting to count with 0. If we start the PE Explorer tool and open the kernel32.dll, we can see that the ordinal numbers start with 1 and not 0:

We can see that when we're using the ordinals, obtaining the address of the function is much faster because we only have to calculate one subtract operation, while with names we have to traverse and compare each and every name in the AddressOfNames array. But by using the ordinals, the whole process is not very compatible over the systems, because the ordinal numbers change if we update or edit the library; the libraries don't get updated so often, but it does happen when the libraries must be updated because of the bug or because the developer's added new features. In this case, if ordinals were used, we would effectively be calling the wrong function.

Complex example of export directory

Let's dump the file headers of the kernel32.dll library. This can be seen in the output below:

[plain]

0:002> !dh 7c800000 -f

File Type: DLL

FILE HEADER VALUES

14C machine (i386)

4 number of sections

506BC5E5 time date stamp Wed Oct 03 06:58:13 2012

0 file pointer to symbol table

0 number of symbols

E0 size of optional header

210E characteristics

Executable

Line numbers stripped

Symbols stripped

32 bit word machine

DLL

OPTIONAL HEADER VALUES

10B magic #

7.10 linker version

83400 size of code

70400 size of initialized data

0 size of uninitialized data

B64E address of entry point

1000 base of code

----- new -----

7c800000 image base

1000 section alignment

200 file alignment

3 subsystem (Windows CUI)

5.01 operating system version

5.01 image version

4.00 subsystem version

F6000 size of image

400 size of headers

FBCBC checksum

00040000 size of stack reserve

00001000 size of stack commit

00100000 size of heap reserve

00001000 size of heap commit

0 DLL characteristics

262C [ 6D19] address [size] of Export Directory

8190C [ 28] address [size] of Import Directory

8A000 [ 65EE8] address [size] of Resource Directory

0 [ 0] address [size] of Exception Directory

0 [ 0] address [size] of Security Directory

F0000 [ 5C8C] address [size] of Base Relocation Directory

841FC [ 38] address [size] of Debug Directory

0 [ 0] address [size] of Description Directory

0 [ 0] address [size] of Special Directory

0 [ 0] address [size] of Thread Storage Directory

4E698 [ 40] address [size] of Load Configuration Directory

0 [ 0] address [size] of Bound Import Directory

1000 [ 624] address [size] of Import Address Table Directory

0 [ 0] address [size] of Delay Import Directory

0 [ 0] address [size] of COR20 Header Directory

0 [ 0] address [size] of Reserved Directory

[/plain]

This time, we're interested in the export directory that starts at RVA 0x262C, which means that the actual address is 0x7c800000+0x262C = 0x7c80262c. We also know that export directory uses the _IMAGE_EXPORT_DIRECTORY structure to present the data. The picture below dumps the memory where the export table is located:

We used the _IMAGE_EXPORT_DIRECTORY structure to dump the memory, which contains a number of fields seen above. The structure is contained in the winnt.h header file and has the following definition:

We can see that we've actually printed those exact elements on the previous picture. Let's now explain the important elements of the IMAGE_EXPORT_DIRECTORY structure:

  • Characteristics [32 bits]: unused.
  • TimeDateStamp [32 bits]: the time the table was created (some linkers set it to 0)
  • MajorVersion [16 bits]: often 0
  • MinorVersion [16 bits]: often 0
  • Name [32 bits]: the name of the DLL
  • Base [32 bits]: the number used to subtract from the ordinal number to get the index into the AddressOfFunctions array.
  • NumberOfFunctions [32 bits]: total number of exported functions, either by name or ordinal.
  • NumberOfNames [32 bits]: number of exported names, which need not be the NumberOfFunctions that presents all of the functions exported by module. Rather than that, this field presents only the number of functions exported by name. Functions can also be exported by ordinal, rather than name. If this value is 0, then all of the functions in this module are exported by ordinal and none of them is exported by name.
  • AddressOfFunctions [32 bits]: a RVA to the list of exported functions - it points to an array of NumberOfFunctions 32-bit values, each being a RVA to the exported function or variable.
  • AddressOfNames [32 bits]: a RVA to the list of exported names - it points to an array of NumberOfNames 32-bit values, each being a RVA to the exported symbol name.
  • AddressOfNameOrdinals [16 bits]: a RVA to the list of ordinals - it points to an array of NumberOfNames 16-bit values, each being an ordinal.

We need to understand that each library needs to keep the information about the imported/exported functions somewhere in PE header. How else would the operating system find it when needed?

The Name file of the IMAGE_EXPORT_DIRECTORY structure holds the RVA address to the name of the library we're currently analyzing. In our case, the RVA address is 0x4b98, which means the whole address where the name of the current library is saved is 0x7c800000+0x4b98. We can print the string located at that address with the da command as shown on the picture below:

We can see that we're currently analyzing kernel32.dll library, which is correct. Let's now dump NumberOfFunctions and NumberOfNames that present the total number of exported functions and the number of exported functions by name. In our case, both numbers are 0x3ba, which means that all of the exported functions in kernel32.dll are exported by name, and none of them is exported by the original. The 0x3ba hexadecimal number is 954 in decimal, which means that kernel32.dll exports 954 function names in total.

Let's now dump the first eight elements of AddressOfFunctions array. Since the RVA of AddressOfFunctions element is 0x2654, we must dump the memory at address 0x7c800000+0x2654. On the picture below, we used the dd command to dump the memory at the virtual address of the AddressOfFunctions array.

We dumped only the first eight addresses. To get the real virtual addresses, we must add 0x7c800000 to the obtained RVAs: 0xa6e4, 0x145cd, …, 0x2d639. The first eight exported functions of the kernel32.dll library use the following addresses:

Function Address Function Name

0x7c80a6e4 ?

0x7c8145cd ?

0x7c8330c1 ?

0x7c871e77 ?

0x7c871e39 ?

0x7c85948a ?

0x7c85936e ?

0x7c82d639 ?

Let's also dump the first eight elements of the AddressOfNames structure. Those elements can be seen on the picture below:

We can simply dump the appropriate names from the newly obtained RVAs with the da command that prints an ASCII string from the given virtual address until null bytes is detected.

The new table with both the function addresses and function names is presented below. Actually, we currently don't know whether all of the addresses actually match the given name, because it's the job of the AddressOfNameOrdinals to determine which function name belongs to which address. Later in the article, we'll see that subsequent indexes are used, so the names and addresses in the table below match.

Function Address Function Name

0x7c80a6e4 ActivateActCtx

0x7c8145cd AddAtomA

0x7c8330c1 AddAtomW

0x7c871e77 AddConsoleAliasA

0x7c871e39 AddConsoleAliasW

0x7c85948a AddLocalAlternateComputerNameA

0x7c85936e AddLocalAlternateComputerNameW

0x7c82d639 AddRefActCtx

We can check very simply whether the addresses are right. We can do this with the u command. On the picture below, we can see three addresses, the first, the second, and the last, which match the table above.

After that, there's only the AddressOfNameOrdinals array left to analyze. So far, we've mentioned that the first element of the AddressOfNameOrdinals should contain the index 0, because the first element in AddressOfNames array points to the first element in the AddressOfFunctions array. The second element in AddressOfNames points to the second element in the AddressOfFunctions, etc. To verify this, we can simply dump the AddressOfNameOrdinals array as can be seen below:

Notice that we used the dw command and not the dd command? This is because the values in the AddressOfNameOridinals are 16-bit numbers and not 32-bit numbers, which is why we must treat them like that.

Conclusion

In this article, we've presented the export directory structure as used by the PE header to export functions of the library. At first, we presented a theory on a simple example of 10 exported functions where 5 of them were exported by name and the other 5 were exported by original. After that, we presented a real example on kernel32.dll, where all functions were exported by name. We traversed all the arrays AddressOfFunctions, AddressOfNames and AddressOfNameOridinals and proven that the presented theory holds and is correct.

Become a certified reverse engineer!

Become a certified reverse engineer!

Get live, hands-on malware analysis training from anywhere, and become a Certified Reverse Engineering Analyst.

Sources

Dejan Lukan
Dejan Lukan

Dejan Lukan is a security researcher for InfoSec Institute and penetration tester from Slovenia. He is very interested in finding new bugs in real world software products with source code analysis, fuzzing and reverse engineering. He also has a great passion for developing his own simple scripts for security related problems and learning about new hacking techniques. He knows a great deal about programming languages, as he can write in couple of dozen of them. His passion is also Antivirus bypassing techniques, malware research and operating systems, mainly Linux, Windows and BSD. He also has his own blog available here: http://www.proteansec.com/.