To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????h???? 001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f683f3f3f3f
SJIS-WIN 遜促他俗h遜促他俗 1001000110111011100100011010001110010001101111001001000110101101011010001001000110111011100100011010001110010001101111001001000110101101 91bb91a391bc91ad6891bb91a391bc91ad
EUC-JP 遜促他俗h遜促他俗 1100001010111101110000101010010111000010101111101100001010101111011010001100001010111101110000101010010111000010101111101100001010101111 c2bdc2a5c2bec2af68c2bdc2a5c2bec2af
UTF-8 遜促他俗h遜促他俗 11101001100000011001110011100100101111111000001111100100101110111001011011100100101111111001011101101000111010011000000110011100111001001011111110000011111001001011101110010110111001001011111110010111 e9819ce4bf83e4bb96e4bf9768e9819ce4bf83e4bb96e4bf97
UHC 遜促他俗h遜促他俗 1110000111100001111101011011010111110110111000101110000111010100011010001110000111100001111101011011010111110110111000101110000111010100 e1e1f5b5f6e2e1d468e1e1f5b5f6e2e1d4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)