Show TOC

Reverse Engineering Encoding FormatLocate this document in the navigation structure

If the code you want to reverse engineer is written with Unicode or MBCS (Multibyte character set), you should use the encoding parameters provided to you in the File Encoding box.

If you want to change these parameters because you know which encoding is used within the sources, you can select the appropriate encoding parameter by clicking the Ellipsis button beside the File Encoding box. This opens the Text Input Encoding Format dialog box in which you can select the encoding format of your choice.



The Text Input Encoding Format dialog box includes the following options:

Option

Description

Encoding hint

Encoding format to be used as hint when reversing the file.

Detection mode

Indicates whether text encoding detection is to be attempted and specifies how much of each file should be analyzed. When enabled, PowerDesigner analyzes a portion of the text, and uses an heuristic based on illegal bytes sequences and/or the presence of encoding-specific tags in order to detect the appropriate encoding that should be used for reading the text.

The following settings are available:

  • No detection - for use when you know what the encoding format is

  • Quick detection - analyzes a small part of the file. For use when you think that the encoding format will be easy to detect

  • Full detection – analyzes the whole file. For use when you think that the number of characters that determine the encoding format is very small

On ambiguous detection

Specifies what action should be taken in case of ambiguity. The following settings are available:

  • Use encoding hint and display warning - the encoding hint format is used and a warning message is displayed.

  • Use encoding hint - the encoding hint format is used but no warning message is displayed.

  • Use detected encoding - the encoding format detected by PowerDesigner is used

Abort on character loss

Allows you to stop reverse engineering if characters cannot be identified and are to be lost in current encoding

Here is an example on how to read encoding formats from the list: