To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 裝脛???址??址 11100101111001001110001111111000001111110011111100111111100110101010110000111111001111111001101010101100 e5e4e3f83f3f3f9aac3f3f9aac
EUC-JP 裝脛???址??址 11101010111001101110011011111010001111110011111100111111110101001010111000111111001111111101010010101110 eae6e6fa3f3f3fd4ae3f3fd4ae
UTF-8 裝脛렊흗歷址흗歷址 111010001010001110011101111010001000010010011011111010111010000010001010111011011001110110010111111001101010110110110111111001011001110110000000111011011001110110010111111001101010110110110111111001011001110110000000 e8a39de8849beba08aed9d97e6adb7e59d80ed9d97e6adb7e59d80
UHC 裝脛렊흗歷址흗歷址 111011011111101111001100111010111000111010100001110010001110100111010101111101101111001010100011110010001110100111010101111101101111001010100011 edfbcceb8ea1c8e9d5f6f2a3c8e9d5f6f2a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)