To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猥??踰??釉??爰???ル?B 111000001100111000111111001111111110011011111010001111110011111111100111110101100011111100111111111000001010011100111111001111110011111110000011100010110011111101000010 e0ce3f3fe6fa3f3fe7d63f3fe0a73f3f3f838b3f42
EUC-JP 猥??踰??釉??爰???ル?B 111000001101000000111111001111111110110011111100001111110011111111101110110110000011111100111111111000001010100100111111001111110011111110100101111010110011111101000010 e0d03f3fecfc3f3feed83f3fe0a93f3f3fa5eb3f42
UTF-8 猥롪봇踰껅략釉롩뇦爰용쾬曆ル툗B 11100111100011001010010111101011101000011010101011101011101101001000011111101000101110001011000011101010101110111000010111101011100111101011010111101001100001111000100111101011101000011010100111101011100001111010011011100111100010001011000011101100100110101010100111101100101111101010110011101111101001101000101111100011100000111010101111101101100010001001011101000010 e78ca5eba1aaebb487e8b8b0eabb85eb9eb5e98789eba1a9eb87a6e788b0ec9aa9ecbeacefa68be383abed889742
UHC 猥롪봇踰껅략釉롩뇦爰용쾬曆ル툗B 11101000111001011000111011101010101110101011111111101011101100101000001111100110101101111010101111101011101110001000111011101001100001111000111011101010101110101011111111101011101100101000001111100110101101111010101111101011101110001000111001000010 e8e58eeababfebb283e6b7abebb88ee9878eeababfebb283e6b7abebb88e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)