To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 蘂??甘v蘂??甘vB 111001010100000100111111001111111000101011000011011101101110010101000001001111110011111110001010110000110111011001000010 e5413f3f8ac376e5413f3f8ac37642
EUC-JP 蘂??甘v蘂??甘vB 111010011010001000111111001111111011010011000101011101101110100110100010001111110011111110110100110001010111011001000010 e9a23f3fb4c576e9a23f3fb4c57642
UTF-8 蘂뜯뮫甘v蘂뜯뮫甘vB 111010001001100010000010111010111001110010101111111010111010111010101011111001111001010010011000011101101110100010011000100000101110101110011100101011111110101110101110101010111110011110010100100110000111011001000010 e89882eb9cafebaeabe7949876e89882eb9cafebaeabe794987642
UHC 蘂뜯뮫甘v蘂뜯뮫甘vB 11100111110111101011011011100010100100101011010111001010111101100111011011100111110111101011011011100010100100101011010111001010111101100111011001000010 e7deb6e292b5caf676e7deb6e292b5caf67642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)