To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ??????嫄???????????????? 00111111001111110011111100111111001111110011111110001111101110101010000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f8fbaa13f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 溜삳젎溜븍눋嫄곕젛溜븍젉溜브굅溜쀫졋溜띾졋溜뺹 111011111010011110001011111011001000001010110011111011001010000010001110111011111010011110001011111010111011100010001101111010111000100010001011111001011010101110000100111010101011001110010101111011001010000010011011111011111010011110001011111010111011100010001101111011001010000010001001111011111010011110001011111010111011100010001100111010101011010110000101111011111010011110001011111011001000000010101011111011001010000110001011111011111010011110001011111010111001110110111110111011001010000110001011111011111010011110001011111010111011101010111001 efa78bec82b3eca08eefa78bebb88deb888be5ab84eab395eca09befa78bebb88deca089efa78bebb88ceab585efa78bec80abeca18befa78beb9dbeeca18befa78bebbab9
UHC 溜삳젎溜븍눋嫄곕젛溜븍젉溜브굅溜쀫졋溜띾졋溜뺹 11101010111111101011101111101011101000001000111111101010111111101011101011101011101101001010110011101010101100011011000011101011101000001001011111101010111111101011101011101011101000001000101111101010111111101011101011101010101100011011000011101010111111101001011111101011101000001011101011101010111111101000110111101011101000001011101011101010111111101001011001000010 eafebbeba08feafebaebb4aceab1b0eba097eafebaeba08beafebaeab1b0eafe97eba0baeafe8deba0baeafe9642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)