To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?泣??淫??域??悠? 1110010011101000100000101110101000111111100010111000001100111111001111111000100011111010001111110011111110001000111001100011111100111111100101110100100100111111 e4e882ea3f8b833f3f88fa3f3f88e63f3f97493f
EUC-JP 蒻れ?泣??淫??域??悠? 1110100011101010101001001110110000111111101101011110001100111111001111111011000011111100001111110011111110110000111010000011111100111111110011011010101000111111 e8eaa4ec3fb5e33f3fb0fc3f3fb0e83f3fcdaa3f
UTF-8 蒻れ슦泣길룚淫딇닞域밟뫁悠욪 111010001001001010111011111000111000001010001100111011001000101010100110111001101011001110100011111010101011100010111000111010111010001110011010111001101011011110101011111010111001010010000111111010111000101110011110111001011001111110011111111010111011000010011111111010111010101110000001111001101000001010100000111011001001101010101010 e892bbe3828cec8aa6e6b3a3eab8b8eba39ae6b7abeb9487eb8b9ee59f9febb09febab81e682a0ec9aaa
UHC 蒻れ슦泣길룚淫딇닞域밟뫁悠욪 11100101101101101010101011101100100110101011000011101011111010001011000111100110100011111001011011101011111000101000101011101101100010001001111011100110101101001011100111100010100100011010010111101010111011011001111101000010 e5b6aaec9ab0ebe8b1e68f96ebe28aed889ee6b4b9e291a5eaed9f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)