To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??椅??濡μ?巍ル?裕???????Ⅲ 1110000110011111001111110011111110001000110101100011111100111111100101000100011110000011110010100011111110011011110110011000001110001011001111111001011101010100001111110011111100111111001111110011111100111111001111111000011101010110 e19f3f3f88d63f3f944783ca3f9bd9838b3f97543f3f3f3f3f3f3f8756
EUC-JP 癲??椅??濡μ?巍ル?裕??洧????? 111000101010000100111111001111111011000011011000001111110011111111000111101010001010011011001100001111111101011011011011101001011110101100111111110011011011010100111111001111111000111111000111101101000011111100111111001111110011111100111111 e2a13f3fb0d83f3fc7a8a6cc3fd6dba5eb3fcdb53f3f8fc7b43f3f3f3f3f
UTF-8 癲ㅻ슡椅썲뵱濡μ돺巍ル쵑裕곫갭洧붿맻連얠Ⅲ 1110011110011001101100101110001110000101101110111110110010001010101000011110011010100100100001011110110010001101101100101110101110110101101100011110011010111111101000011100111010111100111010111000111110111010111001011011011110001101111000111000001110101011111011001011010110010001111010001010001110010101111010101011001110101011111010101011000010101101111001101011010010100111111010111011011010111111111010111010011110111011111011111010011010011010111011001001011010100000111000101000010110100010 e799b2e385bbec8aa1e6a485ec8db2ebb5b1e6bfa1cebceb8fbae5b78de383abecb591e8a395eab3abeab0ade6b4a7ebb6bfeba7bbefa69aec96a0e285a2
UHC 癲ㅻ슡椅썲뵱濡μ돺巍ル쵑裕곫갭洧붿맻連얠Ⅲ 111011111010011010100100111010111001101010101101111010111111010110111101111001011001010010101111111010111010000110100101111011001000100110111101111010001110010010101011111010111010110010010011111010111010111010000001111001101011000010111000111010101111101110010100111011001001000010111100111001101110011010111110111011001010010110110010 efa6a4eb9aadebf5bde594afeba1a5ec89bde8e4abebac93ebae81e6b0b8eafb94ec90bce6e6beeca5b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)