To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 壤??泣??湲ε?醫??永??B 10011010110111110011111100111111100010111000001100111111001111111001111111010001100000111100001100111111111001111100111000111111001111111000100101101001001111110011111101000010 9adf3f3f8b833f3f9fd183c33fe7ce3f3f89693f3f42
EUC-JP 壤??泣??湲ε?醫??永??B 11010100111000010011111100111111101101011110001100111111001111111101111011010011101001101100010100111111111011101101000000111111001111111011000111001010001111110011111101000010 d4e13f3fb5e33f3fded3a6c53feed03f3fb1ca3f3f42
UTF-8 壤깆쥜泣당독湲ε쉽醫묒뒾永띔른B 111001011010001110100100111010101011100110000110111011001010010110011100111001101011001110100011111010111000101110111001111010111000111110000101111001101011100110110010110011101011010111101100100010011011110111101001100001101010101111101011101011001001001011101011100100101011111011100110101100001011100011101011100111011001010011101011101001011011100001000010 e5a3a4eab986eca59ce6b3a3eb8bb9eb8f85e6b9b2ceb5ec89bde986abebac92eb92bee6b0b8eb9d94eba5b842
UHC 壤깆쥜泣당독湲ε쉽醫묒뒾永띔른B 11100101101111011011000111101100101000101001000111101011111010001011010011100111101101011011011011101010101110001010010111100101101111011011000111101100101000101001000111101100100010101011010011100111101101011011011011101010101110001010010101000010 e5bdb1eca291ebe8b4e7b5b6eab8a5e5bdb1eca291ec8ab4e7b5b6eab8a542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)