To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????{^ 0011111100111111001111110011111101111101001111110011111100111111001111110111101101011110 3f3f3f3f7d3f3f3f3f7b5e
SJIS-WIN ??碩荏}??碩荏{^ 001111110011111110010000110101111000100101100000011111010011111100111111100100001101011110001001011000000111101101011110 3f3f90d789607d3f3f90d789607b5e
EUC-JP ??碩荏}??碩荏{^ 001111110011111111000000110110011011000111000001011111010011111100111111110000001101100110110001110000010111101101011110 3f3fc0d9b1c17d3f3fc0d9b1c17b5e
UTF-8 솔센碩荏}솔센碩荏{^ 111011001000011010010100111011001000010010111100111001111010001010101001111010001000110110001111011111011110110010000110100101001110110010000100101111001110011110100010101010011110100010001101100011110111101101011110 ec8694ec84bce7a2a9e88d8f7dec8694ec84bce7a2a9e88d8f7b5e
UHC 솔센碩荏}솔센碩荏{^ 10111100110101101011110010111110111000001011010111101100111110110111110110111100110101101011110010111110111000001011010111101100111110110111101101011110 bcd6bcbee0b5ecfb7dbcd6bcbee0b5ecfb7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)