To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猥??乙??癲??溢??????怨??B 11100000110011100011111100111111100010011011001100111111001111111110000110011111001111110011111110001000111011000011111100111111001111110011111100111111001111111000100110000101001111110011111101000010 e0ce3f3f89b33f3fe19f3f3f88ec3f3f3f3f3f3f89853f3f42
EUC-JP 猥??乙??癲??溢??瓘???怨??B 111000001101000000111111001111111011001010110101001111110011111111100010101000010011111100111111101100001110111000111111001111111000111111001100111011110011111100111111001111111011000111100101001111110011111101000010 e0d03f3fb2b53f3fe2a13f3fb0ee3f3f8fccef3f3f3fb1e53f3f42
UTF-8 猥롢뀧乙대뎔癲뗣꺈溢잒굢瓘劉사윢怨뚰맜B 11100111100011001010010111101011101000011010001011101011100000001010011111100100101110011001100111101011100011001000000011101011100011101001010011100111100110011011001011101011100101111010001111101010101110101000100011100110101110101010001011101100100111101001001011101010101101011010001011100111100100111001100011101111101001111000011111101100100000101010110011101100100111001010001011100110100000001010100011101011100110101011000011101011101001111001110001000010 e78ca5eba1a2eb80a7e4b999eb8c80eb8e94e799b2eb97a3eaba88e6baa2ec9e92eab5a2e79398efa787ec82acec9ca2e680a8eb9ab0eba79c42
UHC 猥롢뀧乙대뎔癲뗣꺈溢잒굢瓘劉사윢怨뚰맜B 111010001110010110001110111000111000010110011110111010111110000010110100111010111011010110110000111011111010011010001011111000111000001110101111111011001110111010011111111010001000001010001001110011101011011011101010111001011011101111100111100111111010001111101010101100111000110011101101100100001010101101000010 e8e58ee3859eebe0b4ebb5b0efa68be383afecee9fe88289ceb6eae5bbe79fa3eab38ced90ab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)