Books
in black and white
Main menu
Share a book About us Home
Books
Biology Business Chemistry Computers Culture Economics Fiction Games Guide History Management Mathematical Medicine Mental Fitnes Physics Psychology Scince Sport Technics
Ads

Teradata RDBMS forUNIX SQL Reference - NCR

NCR Teradata RDBMS forUNIX SQL Reference - NCR, 1997. - 913 p.
Download (direct link): teradataforunix1997.pdf
Previous << 1 .. 220 221 222 223 224 225 < 226 > 227 228 229 230 231 232 .. 241 >> Next


Note: When hashfuncDBC is Kanji, the Teradata RDBMS always expects CHAR or VARCHAR data to be mixed single byte/multibyte characters. This is true for all session character sets, including standard EBCDIC and ASCII.

G-40

Teradata RDBMS for UNIX SQL Reference
International and Japanese Character Support

Japanese Character Support

The name, identifier, and description of each of the predefined character sets are given in Table G-2. Data handling of character sets for each environment is explained in the following passages. Encoding of each character set is given in Chapter H, “Japanese Character Sets.”

Each name suffix (_0X) is an integral part of the character set name and must be included in the <name> specification of a BTEQ [.]SET SESSION CHARSET command or a CLIv2 CHARSET call

Table G-2

Teradata RDBMS Kanji Character Sets

Character Set Name Character Set Id Description
KATAKANAEBCDIC 111 Based on the IBM Japanese Katakana EBCDIC. Corresponds to the following IBM Coded Character Set IDs (CCSIDs): CCSID=4386 (mixed single byte character/ multibyte character data) CCSID=13218 (single byte character data) GCCSID=4396 (graphic data)
KANJIEBCDIC5026_0I 112 Based on the IBM Japanese Extended Katakana character set. Corresponds to the following IBM CCSIDs: MCCSID=5026 (mixed single byte character/ multibyte character data) CCSID=290 (single byte character data) GCCSID=4396 (graphic data)
KANJIEBCDIC5035_0I 113 Based on the IBM Japanese Extended English character set. Corresponds to the following IBM CCSIDs: MCCSID=5035 (mixed single byte character/ multibyte character data) CCSID=1027 (single byte character data) GCCSID=4396 (graphic data)
KANJIEUC_0U 118 Emulates the standard Extended Unix Code style of mixed single byte character/multibyte character data, where the most significant bit of each byte classifies the byte as a single byte character or part of a multibyte character.
KANJISJIS_0S 119 Provided for PC platforms running DOS/V Windows. Emulates the Shift-JIS style of mixed single byte character/multibyte character data, where the range of each byte classifies the byte as a single byte character or part of a multibyte character.

Teradata RDBMS for UNIX SQL Reference

G-41
International and Japanese Character Support

IBM-Compatible Character Sets

IBM-Compatible Character Sets

In this document, the term KanjiEBCDIC is used to refer collectively to the three character sets provided for the IBM VM and MVS environments (KANJIEBCDIC5026_0I, KANJIEBCDIC5035_0I, and KATAKANAEBCDIC).

These character sets emulate the IBM style, where the multibyte character sequence in a character string is bracketed by Shift-Out and Shift-In characters.

The data handling for each set is described in the pages that follow.

G-42

Teradata RDBMS for UNIX SQL Reference
International and Japanese Character Support

KANJIEBCDIC5026 0I.

KANJIEBCDIC5026_0I.

This character set is based on the IBM Japanese Extended Katakana character set. It corresponds to the following three IBM Character Code Set Identifiers (CcSIDs):

• MCCSID=5026 (mixed single byte character/multibyte character data)

• CCSID=290 (single byte character data)

• GCCSID=4396 (graphic data)

Under this character set, a CHARACTER data type supports only mixed single byte characters/multibyte characters. Accepted characters include:

• simple lowercase and uppercase Latin letters

• digits

• other SQL terminal symbols

• single byte character Hankaku Katakana symbols

• various other double byte Kanji characters.

Teradata RDBMS for UNIX SQL Reference

G-43
International and Japanese Character Support

KANJIEBCDIC5035_0I

KANJIEBCDIC5035_0I

This character set is based on the IBM Japanese Extended English character set. It corresponds to the following three IBM Character Code Set Identifiers (CcSIDs):

• MCCSID=5035 (mixed single byte character/multibyte character data)

• CCSID=1027 (single byte character data)

• GCCSID=4396 (graphic data)

Under this character set, a CHARACTER data type supports only mixed single byte characters/multibyte characters. Accepted characters include simple lowercase and uppercase Latin letters, digits, and other SQL terminal symbols; single byte character Hankaku Katakana symbols; double byte Kanji characters, and various other characters.

This character set differs from KANJIEBCDIC5026_0I in the encoding of Katakana symbols and lowercase Latin letters.

This character set supports the same characters as KANJIEBCDIC5026_0I; however, it differs in the encoding of lowercase Latin letters and Hankaku Katakana characters.

G-44

Teradata RDBMS for UNIX SQL Reference
International and Japanese Character Support

KATAKANAEBCDIC

KATAKANAEBCDIC

This character set supports the same characters as KanjiEBCDIC5026-0I, except that it lacks lowercase single byte Latin letters.

This character set is based on Japanese Katakana EBCDIC. It corresponds to the following IBM Character Code Set Identifiers (CCSIDs):
Previous << 1 .. 220 221 222 223 224 225 < 226 > 227 228 229 230 231 232 .. 241 >> Next