To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???}v???}vB 0011111100111111001111110111110101110110001111110011111100111111011111010111011001000010 3f3f3f7d763f3f3f7d7642
SJIS-WIN 竊??}v竊??}vB 11100010100001100011111100111111011111010111011011100010100001100011111100111111011111010111011001000010 e2863f3f7d76e2863f3f7d7642
EUC-JP 竊??}v竊??}vB 11100011111001100011111100111111011111010111011011100011111001100011111100111111011111010111011001000010 e3e63f3f7d76e3e63f3f7d7642
UTF-8 竊뚢봺}v竊뚢봺}vB 1110011110101011100010101110101110011010101000101110101110110100101110100111110101110110111001111010101110001010111010111001101010100010111010111011010010111010011111010111011001000010 e7ab8aeb9aa2ebb4ba7d76e7ab8aeb9aa2ebb4ba7d7642
UHC 竊뚢봺}v竊뚢봺}vB 1110111110111100100011001110001010010100100000010111110101110110111011111011110010001100111000101001010010000001011111010111011001000010 efbc8ce294817d76efbc8ce294817d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)