How to read Chinese characters from DICOM

Message

vizlab · #1 Post by **vizlab** » Tue, 2011-07-26, 15:04

I have some studies that have patient, study description and referring physician name in Chinese. I am using findAndGetOfString to get the values for these dicom tags. I have a feeling that the method does not return unicode or wide string, rather returns a string in ASCII replacing some characters with * ? or .

Is it possible to reconstruct the original string with the result of findAndGetOfString method? If not, what should I do get these dicom tag values?

#2 Post by **Michael Onken** » Tue, 2011-07-26, 15:41

Hi,

findAndGetOFString gives you each 8 byte character as is whithout any character set interpretation. You must interpret it according to the character set defined in the Specific Character Set (0008,0005). There is no functionality for doing character set conversions (e.g. from specific japanese character sets to UTF-8 ).

For a list of character sets possible in DICOM look in part 3. It seems to me that for chinese you basically have choice between GB18030 and Unicode (UTF-8 ).

Also look into part 5 for examples how to do chinese patient names in Unicode in DICOM.

Michael

J. Riesmeier · #3 Post by **J. Riesmeier** » Tue, 2011-11-01, 18:10

As a follow-up: Today, we've completed a first version of an enhanced character set support for the DCMTK. This also includes the ISO 2022 code extension technique. If compiled with "libiconv", all affected strings in a DICOM dataset using any of the DICOM character sets can now be converted to UTF-8 (see "dcmconv --convert-to-utf8"). As already mentioned, this is only a first step of enhanced character set support ...

If you are interested in details, check our public git repository!

kamil · #4 Post by **kamil** » Thu, 2014-05-15, 03:05

J. Riesmeier wrote:As a follow-up: Today, we've completed a first version of an enhanced character set support for the DCMTK. This also includes the ISO 2022 code extension technique. If compiled with "libiconv", all affected strings in a DICOM dataset using any of the DICOM character sets can now be converted to UTF-8 (see "dcmconv --convert-to-utf8"). As already mentioned, this is only a first step of enhanced character set support ...

If you are interested in details, check our public git repository!

hi, I want find a way to compile libiconv used in dcmtk 3.6.1, here is my situation, any suggestion? Thanks

DICOM @ OFFIS

How to read Chinese characters from DICOM

How to read Chinese characters from DICOM

Re:

Who is online