To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 雍??巍??葉??暎?????宥??耀??B 11101000101101000011111100111111100110111101100100111111001111111001011101110100001111110011111110011101111100110011111100111111001111110011111100111111100101110100011100111111001111111001011101110011001111110011111101000010 e8b43f3f9bd93f3f97743f3f9df33f3f3f3f3f97473f3f97733f3f42
EUC-JP 雍??巍??葉??暎?????宥??耀??B 11110000101101100011111100111111110101101101101100111111001111111100110111010101001111110011111111011010111101010011111100111111001111110011111100111111110011011010100000111111001111111100110111010100001111110011111101000010 f0b63f3fd6db3f3fcdd53f3fdaf53f3f3f3f3fcda83f3fcdd43f3f42
UTF-8 雍됰젙巍띾떩葉띌쪛暎녘펹溜뺡냽宥사㉥耀붺껙B 11101001100110111000110111101011100100001011000011101100101000001001100111100101101101111000110111101011100111011011111011101011100101101010100111101000100100011000100111101011100111011000110011101100101010101001101111100110100110101000111011101011100001011001100011101101100011101011100111101111101001111000101111101011101110101010000111101011100000111011110111100101101011101010010111101100100000101010110011100011100010011010010111101000100000001000000011101011101101101011101011101010101110111001100101000010 e99b8deb90b0eca099e5b78deb9dbeeb96a9e89189eb9d8cecaa9be69a8eeb8598ed8eb9efa78bebbaa1eb83bde5aea5ec82ace389a5e88080ebb6baeabb9942
UHC 雍됰젙巍띾떩葉띌쪛暎녘펹溜뺡냽宥사㉥耀붺껙B 11101000101111001000100111101011101000001001010111101000111001001000110111101011100010111011101111100111101010001011011011101001101001011001010011100111101100101011001111101000101111001000100111101010111111101001010111101001100001101000110111101010111010011011101111100111101010001011011011101001101001011001010011100111101100101011001101000010 e8bc89eba095e8e48deb8bbbe7a8b6e9a594e7b2b3e8bc89eafe95e9868deae9bbe7a8b6e9a594e7b2b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)