To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇?????晤??循 11101010100001110011111100111111001111110011111100111111100111011110101100111111001111111000111101111010 ea873f3f3f3f3f9deb3f3f8f7a
EUC-JP 鼇?????晤??循 11110011111001110011111100111111001111110011111100111111110110101110110100111111001111111011110111011011 f3e73f3f3f3f3fdaed3f3fbddb
UTF-8 鼇믭쉿樂뽭쭠晤뚩뜵循 111010011011110010000111111010111010111110101101111011001000100110111111111011111010011010111111111010111011110110101101111011001010110110100000111001101001100110100100111010111001101010101001111010111001110010110101111001011011111010101010 e9bc87ebafadec89bfefa6bfebbdadecada0e699a4eb9aa9eb9cb5e5beaa
UHC 鼇믭쉿樂뽭쭠晤뚩뜵循 1110100010101000100100101110111110111101101100101110100011111001100101101110100110100111100101011110011111111011100011001110100010001101101100111110001011100000 e8a892efbdb2e8f996e9a795e7fb8ce88db3e2e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)