To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??椅??鎖??偃???陰??怨??^ 1110000110011111001111110011111110001000110101100011111100111111100011011011110100111111001111111001100011101110001111110011111100111111100010010100000100111111001111111000100110000101001111110011111101011110 e19f3f3f88d63f3f8dbd3f3f98ee3f3f3f89413f3f89853f3f5e
EUC-JP 癲??椅??鎖??偃???陰??怨??^ 1110001010100001001111110011111110110000110110000011111100111111101110101011111100111111001111111101000011110000001111110011111100111111101100011010001000111111001111111011000111100101001111110011111101011110 e2a13f3fb0d83f3fbabf3f3fd0f03f3f3fb1a23f3fb1e53f3f5e
UTF-8 癲몃돆椅길뤃鎖듬짎偃김굜짰陰믧걬怨룸쨰^ 11100111100110011011001011101011101010101000001111101011100011111000011011100110101001001000010111101010101110001011100011101011101001001000001111101001100011101001011011101011100100111010110011101100101001111000111011100101100000011000001111101010101110011000000011101010101101011001110011101100101001111011000011101001100110011011000011101011101011111010011111101010101100011010110011100110100000001010100011101011101000111011100011101100101010001011000001011110 e799b2ebaa83eb8f86e6a485eab8b8eba483e98e96eb93aceca78ee58183eab980eab59ceca7b0e999b0ebafa7eab1ace680a8eba3b8eca8b05e
UHC 癲몃돆椅길뤃鎖듬짎偃김굜짰陰믧걬怨룸쨰^ 111011111010011010111000111010111000100110010111111010111111010110110001111001101000111110110100111000011111000010110101111010111010001110011010111001011110011110110001111010001000001010000100110000101010111011101011111001001001001011101001100000011001010111101010101100111011011111101011101001001000101001011110 efa6b8eb8997ebf5b1e68fb4e1f0b5eba39ae5e7b1e88284c2aeebe492e98195eab3b7eba48a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)