To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????G 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f47
SJIS-WIN ???違??蟻?????裕??循??鶯??G 001111110011111100111111100010001110000100111111001111111000101101100001001111110011111100111111001111110011111110010111010101000011111100111111100011110111101000111111001111111110100111110010001111110011111101000111 3f3f3f88e13f3f8b613f3f3f3f3f97543f3f8f7a3f3fe9f23f3f47
EUC-JP ???違??蟻??孼??裕??循??鶯??G 0011111100111111001111111011000011100011001111110011111110110101110000100011111100111111100011111011101011000011001111110011111111001101101101010011111100111111101111011101101100111111001111111111001011110100001111110011111101000111 3f3f3fb0e33f3fb5c23f3f8fbac33f3fcdb53f3fbddb3f3ff2f43f3f47
UTF-8 捻뀁궠違띷뤃蟻믪쭍孼꾠끇裕드춢循뗫걤鶯뱁꽮G 11101111101001101010010011101011100000001000000111101010101101101010000011101001100000011001010111101011100111011011011111101011101001001000001111101000100111111011101111101011101011111010101011101100101011011000110111100101101011011011110011101010101111101010000011101011100000011000011111101000101000111001010111101011100100111001110011101100101101101010001011100101101111101010101011101011100101111010101111101010101100011010010011101001101101101010111111101011101100011000000111101010101111011010111001000111 efa6a4eb8081eab6a0e98195eb9db7eba483e89fbbebafaaecad8de5adbceabea0eb8187e8a395eb939cecb6a2e5beaaeb97abeab1a4e9b6afebb181eabdae47
UHC 捻뀁궠違띷뤃蟻믪쭍孼꾠끇裕드춢循뗫걤鶯뱁꽮G 11100110111101111011001011101100100000101011001111101010110111101000110111100110100011111011010011101011111111001001001011101100101001111000011011100101111011011000010011100011100001011011101111101011101011101011010111100101101011011000001111100010111000001000101111101011100000011000110111100101101000111011100111101101100001001011100101000111 e6f7b2ec82b3eade8de68fb4ebfc92eca786e5ed84e385bbebaeb5e5ad83e2e08beb818de5a3b9ed84b947

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)