To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鴦???ゅ?湲??B 11101001111100010011111100111111001111111000001011100011001111111001111111010001001111110011111101000010 e9f13f3f3f82e33f9fd13f3f42
EUC-JP 鴦???ゅ?湲??B 11110010111100110011111100111111001111111010010011100101001111111101111011010011001111110011111101000010 f2f33f3f3fa4e53fded33f3f42
UTF-8 鴦볤막痢ゅ럳湲룹릉B 11101001101101001010011011101011101100111010010011101011101001111000100111101111101001111010010111100011100000101000010111101011100111111011001111100110101110011011001011101011101000111011100111101011101001101000100101000010 e9b4a6ebb3a4eba789efa7a5e38285eb9fb3e6b9b2eba3b9eba68942
UHC 鴦볤막痢ゅ럳湲룹릉B 11100100111011001001001111101010101110001011011111101100101110001010101011100101100011101001001111101010101110001011011111101100101110001010101001000010 e4ec93eab8b7ecb8aae58e93eab8b7ecb8aa42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)