To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?踰??喩?????猷??濡ロ?億?? 1110000110011111100000111000101100111111111001101111101000111111001111111001101001100111001111110011111100111111001111110011111110010111010100010011111100111111100101000100011110000011100011010011111110001001101011010011111100111111 e19f838b3fe6fa3f3f9a673f3f3f3f3f97513f3f9447838d3f89ad3f3f
EUC-JP 癲ル?踰??喩?????猷??濡ロ?億?? 1110001010100001101001011110101100111111111011001111110000111111001111111101001111001000001111110011111100111111001111110011111111001101101100100011111100111111110001111010100010100101111011010011111110110010101011110011111100111111 e2a1a5eb3fecfc3f3fd3c83f3f3f3f3fcdb23f3fc7a8a5ed3fb2af3f3f
UTF-8 癲ル슢踰딂짆喩묎슈銳얜틳猷뗩뼸濡ロ닕億됱룮 111001111001100110110010111000111000001110101011111011001000101010100010111010001011100010110000111010111001010010000010111011001010011110000110111001011001011010101001111010111010110010001110111011001000101010001000111010011000101010110011111011001001011010011100111011011000101110110011111001111000110010110111111010111001011110101001111010111011110010111000111001101011111110100001111000111000001110101101111010111000101110010101111001011000010010000100111010111001000010110001111010111010001110101110 e799b2e383abec8aa2e8b8b0eb9482eca786e596a9ebac8eec8a88e98ab3ec969ced8bb3e78cb7eb97a9ebbcb8e6bfa1e383adeb8b95e58484eb90b1eba3ae
UHC 癲ル슢踰딂짆喩묎슈銳얜틳猷뗩뼸濡ロ닕億됱룮 111011111010011010101011111010111001101010101110111010111011001010001010111010001010001110010101111010101110011110010001111010101011110110110100111001111110010110111110111010111011101010011011111010111010001110001011111010011001011010111011111010111010000110101011111011011000100010011001111001011110001010001001111011001000111110100100 efa6abeb9aaeebb28ae8a395eae791eabdb4e7e5beebba9beba38be996bbeba1abed8899e5e289ec8fa4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)