To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN ???恁れ????Lh???恁れ????L 00111111001111110011111110011100100011001000001011101010001111110011111100111111001111110100110001101000001111110011111100111111100111001000110010000010111010100011111100111111001111110011111101001100 3f3f3f9c8c82ea3f3f3f3f4c683f3f3f9c8c82ea3f3f3f3f4c
EUC-JP ???恁れ????Lh???恁れ????L 00111111001111110011111111010111111011001010010011101100001111110011111100111111001111110100110001101000001111110011111100111111110101111110110010100100111011000011111100111111001111110011111101001100 3f3f3fd7eca4ec3f3f3f3f4c683f3f3fd7eca4ec3f3f3f3f4c
UTF-8 梨낇삦恁れ콝吏뱁삛Lh梨낇삦恁れ콝吏뱁삛L 111011111010011110100010111010111000001010000111111011001000001010100110111001101000000110000001111000111000001010001100111011001011110110011101111011111010011110011110111010111011000110000001111011001000001010011011010011000110100011101111101001111010001011101011100000101000011111101100100000101010011011100110100000011000000111100011100000101000110011101100101111011001110111101111101001111001111011101011101100011000000111101100100000101001101101001100 efa7a2eb8287ec82a6e68181e3828cecbd9defa79eebb181ec829b4c68efa7a2eb8287ec82a6e68181e3828cecbd9defa79eebb181ec829b4c
UHC 梨낇삦恁れ콝吏뱁삛Lh梨낇삦恁れ콝吏뱁삛L 111011001011000110000101111011011001100010100101111011001111011010101010111011001011000110010101111011001010011110111001111011011001100010011110010011000110100011101100101100011000010111101101100110001010010111101100111101101010101011101100101100011001010111101100101001111011100111101101100110001001111001001100 ecb185ed98a5ecf6aaecb195eca7b9ed989e4c68ecb185ed98a5ecf6aaecb195eca7b9ed989e4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)