To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 趙???伊???精v趙???伊???精vB 111001101110001000111111001111110011111110001000110010010011111100111111001111111001000010111000011101101110011011100010001111110011111100111111100010001100100100111111001111110011111110010000101110000111011001000010 e6e23f3f3f88c93f3f3f90b876e6e23f3f3f88c93f3f3f90b87642
EUC-JP 趙???伊???精v趙???伊???精vB 111011001110010000111111001111110011111110110000110010110011111100111111001111111100000010111010011101101110110011100100001111110011111100111111101100001100101100111111001111110011111111000000101110100111011001000010 ece43f3f3fb0cb3f3f3fc0ba76ece43f3f3fb0cb3f3f3fc0ba7642
UTF-8 趙얗렎렯伊브렣쾅精v趙얗렎렯伊브렣쾅精vB 111010001011011010011001111011001001011010010111111010111010000010001110111010111010000010101111111001001011110010001010111010111011100010001100111010111010000010100011111011001011111010000101111001111011001010111110011101101110100010110110100110011110110010010110100101111110101110100000100011101110101110100000101011111110010010111100100010101110101110111000100011001110101110100000101000111110110010111110100001011110011110110010101111100111011001000010 e8b699ec9697eba08eeba0afe4bc8aebb88ceba0a3ecbe85e7b2be76e8b699ec9697eba08eeba0afe4bc8aebb88ceba0a3ecbe85e7b2be7642
UHC 趙얗렎렯伊브렣쾅精v趙얗렎렯伊브렣쾅精vB 111100001110000110111110111010011000111010100100100011101011110011101100101001011011101011101010100011101011010011000100111001111110111111110001011101101111000011100001101111101110100110001110101001001000111010111100111011001010010110111010111010101000111010110100110001001110011111101111111100010111011001000010 f0e1bee98ea48ebceca5baea8eb4c4e7eff176f0e1bee98ea48ebceca5baea8eb4c4e7eff17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)