To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 額?????冶??Lh額?????冶??L 10001010011110100011111100111111001111110011111100111111100101101110100000111111001111110100110001101000100010100111101000111111001111110011111100111111001111111001011011101000001111110011111101001100 8a7a3f3f3f3f3f96e83f3f4c688a7a3f3f3f3f3f96e83f3f4c
EUC-JP 額?????冶??Lh額?????冶??L 10110011110110110011111100111111001111110011111100111111110011001110101000111111001111110100110001101000101100111101101100111111001111110011111100111111001111111100110011101010001111110011111101001100 b3db3f3f3f3f3fccea3f3f4c68b3db3f3f3f3f3fccea3f3f4c
UTF-8 額ㅻ젡說븍젒冶먮젧Lh額ㅻ젡說븍젒冶먮젧L 111010011010000110001101111000111000010110111011111011001010000010100001111011111010011010100001111010111011100010001101111011001010000010010010111001011000011010110110111010111010100010101110111011001010000010100111010011000110100011101001101000011000110111100011100001011011101111101100101000001010000111101111101001101010000111101011101110001000110111101100101000001001001011100101100001101011011011101011101010001010111011101100101000001010011101001100 e9a18de385bbeca0a1efa6a1ebb88deca092e586b6eba8aeeca0a74c68e9a18de385bbeca0a1efa6a1ebb88deca092e586b6eba8aeeca0a74c
UHC 額ㅻ젡說븍젒冶먮젧Lh額ㅻ젡說븍젒冶먮젧L 111001001111111010100100111010111010000010011010111001101111001010111010111010111010000010010001111001011010011110010000111010111010000010011111010011000110100011100100111111101010010011101011101000001001101011100110111100101011101011101011101000001001000111100101101001111001000011101011101000001001111101001100 e4fea4eba09ae6f2baeba091e5a790eba09f4c68e4fea4eba09ae6f2baeba091e5a790eba09f4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)