To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋??泣??惟??櫻??寃?? 1001101011001000001111110011111110001011100000110011111100111111100010001101001000111111001111111001111101001110001111110011111110011011100000110011111100111111 9ac83f3f8b833f3f88d23f3f9f4e3f3f9b833f3f
EUC-JP 塋??泣??惟??櫻??寃?? 1101010011001010001111110011111110110101111000110011111100111111101100001101010000111111001111111101110110101111001111110011111111010101111000110011111100111111 d4ca3f3fb5e33f3fb0d43f3fddaf3f3fd5e33f3f
UTF-8 塋딄랬泣덂컜惟듭뒳櫻뗫툥寃쀧독 111001011010000110001011111010111001010010000100111010111001111010101100111001101011001110100011111010111000110110000010111011001011101110011100111001101000001110011111111010111001001110101101111010111001001010110011111001101010101110111011111010111001011110101011111011011000100010100101111001011010111110000011111011001000000010100111111010111000111110000101 e5a18beb9484eb9eace6b3a3eb8d82ecbb9ce6839feb93adeb92b3e6abbbeb97abed88a5e5af83ec80a7eb8f85
UHC 塋딄랬泣덂컜惟듭뒳櫻뗫툥寃쀧독 111001111010101110001010111010101011011110101000111010111110100010001000111001011011000010000111111010101110111010110101111011001000101010101100111001011010000110001011111010111011100010011100111010101011001010010111111001111011010110110110 e7ab8aeab7a8ebe888e5b087eaeeb5ec8aace5a18bebb89ceab297e7b5b6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)