To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰→????域??異?┼???倚 1110000110011111001111110011111111100110111110101000000110101000001111110011111100111111001111111000100011100110001111110011111110001000110110010011111110000100101010010011111100111111001111111001100011011111 e19f3f3fe6fa81a83f3f3f3f88e63f3f88d93f84a93f3f3f98df
EUC-JP 癲??踰→????域??異?┼洧??倚 11100010101000010011111100111111111011001111110010100010101010100011111100111111001111110011111110110000111010000011111100111111101100001101101100111111101010001010101110001111110001111011010000111111001111111101000011100001 e2a13f3fecfca2aa3f3f3f3fb0e83f3fb0db3fa8ab8fc7b43f3fd0e1
UTF-8 癲녴굥踰→칲類욌눤域밟뫁異듸┼洧쏅뿫倚 111001111001100110110010111010111000010110110100111010101011010110100101111010001011100010110000111000101000011010010010111011001011100110110010111011111010011110010000111011001001101010001100111010111000100010100100111001011001111110011111111010111011000010011111111010111010101110000001111001111001010110110000111010111001001110111000111000101001010010111100111001101011010010100111111011001000111110000101111010111011111110101011111001011000000010011010 e799b2eb85b4eab5a5e8b8b0e28692ecb9b2efa790ec9a8ceb88a4e59f9febb09febab81e795b0eb93b8e294bce6b4a7ec8f85ebbfabe5809a
UHC 癲녴굥踰→칲類욌눤域밟뫁異듸┼洧쏅뿫倚 1110111110100110100001101110001110000010100010111110101110110010101000011110011010101111100001011110101110111010100111101110101110000111101110111110011010110100101110011110001010010001101001011110110010110110101101011110111110100110101010111110101011111011100110111110101110010111101010111110101111101111 efa686e3828bebb2a1e6af85ebba9eeb87bbe6b4b9e291a5ecb6b5efa6abeafb9beb97abebef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)