While e-discovery may be Greek to many, it is those documents written in Chinese, Cheap Rosetta Stone Japanese, Korean and Russian that cause much of the trouble. These multi-byte languages have exponentially more characters than the 26 letters and few other punctuation marks that Latin languages like English, Spanish, French and German need. In fact, the number of Chinese characters included in the Kangxi dictionary is over 47,000 (though only 3-4,000 are reportedly necessary for full literacy). The impact on e-discovery is significant considering the increased sophistication necessary for case evaluation.At the most basic level, computers think in ones and zeros, with a one or zero being a bit. Eight bits is a byte. There are 256 different combinations of numbers you can create using a byte (2 (bits) to the 8th power). For languages that are not based solely on letters, i.e., those where symbols represent a concept or a syllable, you need to add bytes (256 x 256, which equals 66,536). That is the essence of multi-byte vs. single-byte languages single-byte languages have 256 possible combinations, while multi-byte languages have 66,536.Confused? Then lets address Rosetta Stone Chinese codings. An encoding is a programmatical translation of what you input to what you get on the screen. The problem is when you have multiple encodings. For example, when analyzing an Outlook 2000 e-mail file (PST format) under a Japanese operating system, which you then convert to an English-language machine for review, there will be problems because the native data in Japanese is corrupted due to linguistic differences.Unicode was created to solve some of these problems and offer a universal solution; however, it is only available for files created on newer systems, making legacy data a continuing area of concern. Each language family has its own unique set of problems and solutions, says Thomas Barnett, Special Counsel for Sullivan Cromwell, LLP.In fact, in some parts of the world, you are not allowed to take the data out of the country due to local data protection laws, adds Brian Kim of PriceWaterhouseCoopers LLP. He highlights that certain countries also have native applications that are more popular than those commonly used in the United Rosetta Stone Japanese States, requiring additional evaluation of your program inventory.



0 评论:
发表评论