MDT (Gotcha Force): Difference between revisions

← Older edit

VisualWikitext

Latest revision as of 14:41, 7 October 2023

← Gotcha Force

This article is about Gotcha Force MDT file format and ongoing researches on it.

This file format needs more research.
Researches on headers / bodies structures are partially achieved.

mdt files:

MDT files are same than pzz but packing only uncompressed files.

MDT header (0x800 bytes):

4 bytes - uint32 file count (big endian)
4 bytes - uint32 array: file size divided by 0x800 and upper rounded for each file

All .mdt contains 2 files:

000 dat file with texts used in the game
001 TPL including an alphanumeric alphabet used by the dat file

dat file:

header:

4 bytes - paragraph_offsets_blocks_list offset
4 bytes - symbols_count # number of symbols contained in the tpl
2 bytes - universal symbols ID shared between all .mdt and positional from tpl file

paragraph_offsets_blocks_list:

4 bytes - uint32 absolute offsets - array of paragraph_offsets_block terminated by -1 (FF FF FF FF)

paragraph_offsets_block 0:

first paragraph_offsets_block describing all paragraph offsets in the data block 0

4 bytes - int32 array of absolute offsets which point to pragraphs in the data block terminated by -1 (FF FF FF FF). The first offset is the data block offset.

data_block 0:

2 bytes - signed int16 list <- paragraph_offsets_block 0 describe where each paragraph begins.

paragraph_offsets_block 1: ... data_block 1: ...

align:

The header length is aligned to 32 bytes and if there is no pad we add 32 bytes of pad. paragraphs_offsets_block are aligned to 16 bytes. Each paragraph is aligned to 32 bytes.

Text engine NTSC/USA

data_block contains int16 indexes relative to the tpl symbol list. First symbol in the tpl has number 0 then horizontally symbol 1, 2 ... So data block(s) contains texts.

FFyy: <- each negative value describes something not present in the tpl
- FFFE: Appends a whitespace character to the text.
- FFFD: Appends a small whitespace character to the text.
10yy:
- 1000: Appends a newline character to the text.
- 1001: Marks the end of the text. All text must end with this.
80yy:
- 8000: Appends the Player's name to the text.
- 8001: Appends an unidentified string to the text.
- 8002 (1 param): Applies a color preset from the param index to the text. An 0xFFFF param disables it.
- 8003 (1 param): Appends an unidentified string from the param index to the text.

The text engine uses a hash table corresponding to symbols and allow to color the textures or use vars like player name. This hashtable is also used in the dol data sections for instance to encode texts.

Each paragraph begins with:

2 bytes - total_paragraph_len - starting after the paragraph header and uint16 count
2 bytes - total_lines_count
2 bytes - max_lines_len - including the last \x10\x01

Here is the hashtable:

Virtual World RE has developed the python script mdttool.py to manipulate MDT files and their internal files allowing to unpack / pack mdt files.

@@ Line 1: / Line 1: @@
 [[Gotcha Force | &larr; Gotcha Force]]
-<div style="text-align: center;">
+''This article is about Gotcha Force MDT file format and ongoing researches on it.''
-<div style="color: rgb(241, 196, 15);">This section is currently being written.</div>
-<div style="color: rgb(241, 196, 15); text-align: center;">More research is needed and some paragraphs may be wrong.</div>
-</div>
-MDT files are similar to PZZ files, with a header of 0x800 / 2048 bytes including :
+{{Research | 2| Researches on headers / bodies structures are partially achieved. }}
-* the number of files (uint32 big endian)
-* the file size / 0x800 for each file
-It is possible to unpack and repack .mdt files with the pzztool.py tool. Unlike pzz, the internal files of .mdt are packaged and not compressed.
+== mdt files: ==
+MDT files are same than [[PZZ (Gotcha Force)|pzz]] but packing only uncompressed files.
-* 000 dat file contains at first sight all texts of the game (Related to all dat file, not only one).
+MDT header (0x800 bytes):
+* 4 bytes - uint32 file count (big endian)
+* 4 bytes - uint32 array: file size divided by 0x800 and upper rounded for each file
+All .mdt contains 2 files:
+* 000 dat file with texts used in the game
 * 001 TPL including an alphanumeric alphabet used by the dat file
-dat file header:
+== dat file: ==
-* 4 bytes - header_length
+=== header: ===
+* 4 bytes - paragraph_offsets_blocks_list offset
 * 4 bytes - symbols_count # number of symbols contained in the tpl
-* 2 bytes - offsets for each symbol - in the file relative to after the header (array)
+* 2 bytes - universal symbols ID shared between all .mdt and positional from tpl file
-dat file block 1:
-* 4 bytes - offset where the next block begin
+=== paragraph_offsets_blocks_list: ===
-* 4 bytes - unknown
+* 4 bytes - uint32 absolute offsets - array of paragraph_offsets_block terminated by -1 (FF FF FF FF)
-dat file block 2:
+=== paragraph_offsets_block 0: ===
-* 4 bytes - uint32 - list of offsets relative to this file ?
+# first paragraph_offsets_block describing all paragraph offsets in the data block 0
+* 4 bytes - int32 array of absolute offsets which point to pragraphs in the data block terminated by -1 (FF FF FF FF). The first offset is the data block offset.
+=== data_block 0: ===
+* 2 bytes - signed int16 list <- paragraph_offsets_block 0 describe where each paragraph begins.
+paragraph_offsets_block 1: ...
+data_block 1: ...
+=== align: ===
+The header length is aligned to 32 bytes and if there is no pad we add 32 bytes of pad. paragraphs_offsets_block are aligned to 16 bytes. Each paragraph is aligned to 32 bytes.
+== Text engine NTSC/USA ==
+data_block contains int16 indexes relative to the tpl symbol list. First symbol in the tpl has number 0 then horizontally symbol 1, 2 ... So data block(s) contains texts.
+* FFyy: <- each negative value describes something not present in the tpl
+** FFFE: Appends a whitespace character to the text.
+** FFFD: Appends a small whitespace character to the text.
+* 10yy:
+** 1000: Appends a newline character to the text.
+** 1001: Marks the end of the text. All text must end with this.
+* 80yy:
+** 8000: Appends the Player's name to the text.
+** 8001: Appends an unidentified string to the text.
+** 8002 (1 param): Applies a color preset from the param index to the text. An 0xFFFF param disables it.
+** 8003 (1 param): Appends an unidentified string from the param index to the text.
+The text engine uses a hash table corresponding to symbols and allow to color the textures or use vars like player name. This hashtable is also used in the dol data sections for instance to encode texts.
+Each paragraph begins with:
+* 2 bytes - total_paragraph_len - starting after the paragraph header and uint16 count
+* 2 bytes - total_lines_count
+* 2 bytes - max_lines_len - including the last \x10\x01
-Virtual World RE has developed the python script [https://github.com/Virtual-World-RE/NeoGF/tree/main/pzztool pzztool.py] to manipulate MDT files and their internal files.
+Here is the hashtable:
+43 ,
+44 .
+45 °
+46 :
+47 ;
+48 ?
+49 !
+51 _
+5e /
+65 `
+66 '
+68 "
+69 (
+6a )
+7b +
+7c -
+7e ×
+80 ÷
+81 =
+83 <
+84 >
+93 %
+94 #
+95 &
+96 *
+97 @
+a5 `
+4f 0
+50 1
+51 2
+52 3
+53 4
+54 5
+55 6
+56 7
+57 8
+58 9
+60 A
+61 B
+62 C
+63 D
+64 E
+65 F
+66 G
+67 H
+68 I
+69 J
+6a K
+6b L
+6c M
+6d N
+6e O
+6f P
+70 Q
+71 R
+72 S
+73 T
+74 U
+75 V
+76 W
+77 X
+78 Y
+79 Z
+81 a
+82 b
+83 c
+84 d
+85 e
+86 f
+87 g
+88 h
+89 i
+8a j
+8b k
+8c l
+8d m
+8e n
+8f o
+90 p
+91 q
+92 r
+93 s
+94 t
+95 u
+96 v
+97 w
+98 x
+99 y
+9a z
+bf `
+ce `
+c ba `
+c d5 `
+e e9 `
+9d `
+c2 `
+b4 `
+92 `
+90 `
+b4 `
+Virtual World RE has developed the python script [https://github.com/Virtual-World-RE/NeoGF/tree/main/mdttool mdttool.py] to manipulate MDT files and their internal files allowing to unpack / pack mdt files.
-PGCD of MDT file sizes: 2048 / 0x800
 [[Category:File format]]
 [[Category:Gotcha Force]]