To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8踰??宋??沃??日??唯??肄? 11100001100111110011111110000010010101111110011011111010001111110011111110010001011101100011111100111111100101111000000000111111001111111001001111111010001111110011111110010111010000100011111100111111111000111110010100111111 e19f3f8257e6fa3f3f91763f3f97803f3f93fa3f3f97423f3fe3e53f
EUC-JP 癲?8踰??宋??沃??日??唯??肄? 11100010101000010011111110100011101110001110110011111100001111110011111111000001110101110011111100111111110011011110000000111111001111111100011011111100001111110011111111001101101000110011111100111111111001101110011100111111 e2a13fa3b8ecfc3f3fc1d73f3fcde03f3fc6fc3f3fcda33f3fe6e73f
UTF-8 癲쒕8踰뤻뇡宋믨콟沃섃뫜日뗥보唯몃뀪肄웏 111001111001100110110010111011001001001010010101111011111011110010011000111010001011100010110000111010111010010010111011111010111000011110100001111001011010111010001011111010111010111110101000111011001011110110011111111001101011001010000011111011001000010010000011111010111010101110011100111001101001011110100101111010111001011110100101111010111011001110110100111001011001010010101111111010111010101010000011111010111000000010101010111010001000001010000100111011001001101110001111 e799b2ec9295efbc98e8b8b0eba4bbeb87a1e5ae8bebafa8ecbd9fe6b283ec8483ebab9ce697a5eb97a5ebb3b4e594afebaa83eb80aae88284ec9b8f
UHC 癲쒕8踰뤻뇡宋믨콟沃섃뫜日뗥보唯몃뀪肄웏 11101111101001101001110011101011101000111011100011101011101100101000111111101001100001111000100111100001111001001001001011101010101100011001011111101000101010101001100011100010100100011011110011101100111011011000101111100101101110101011100011101010111001101011100011101011100001011010000011101100101111011001111101100001 efa69ceba3b8ebb28fe98789e1e492eab197e8aa98e291bceced8be5bab8eae6b8eb85a0ecbd9f61

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)