To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ã󃋎Tã󃋎TB 11100011111100111000001110001011100011100101010011100011111100111000001110001011100011100101010001000010 e3f3838b8e54e3f3838b8e5442
SJIS-WIN ?????T?????TB 00111111001111110011111100111111001111110101010000111111001111110011111100111111001111110101010001000010 3f3f3f3f3f543f3f3f3f3f5442
EUC-JP ãó???Tãó???TB 100011111010101110101010100011111010101111010001001111110011111100111111010101001000111110101011101010101000111110101011110100010011111100111111001111110101010001000010 8fabaa8fabd13f3f3f548fabaa8fabd13f3f3f5442
UTF-8 ã󃋎Tã󃋎TB 1100001110100011110000111011001111000010100000111100001010001011110000101000111001010100110000111010001111000011101100111100001010000011110000101000101111000010100011100101010001000010 c3a3c3b3c283c28bc28e54c3a3c3b3c283c28bc28e5442
UHC ?????T?????TB 00111111001111110011111100111111001111110101010000111111001111110011111100111111001111110101010001000010 3f3f3f3f3f543f3f3f3f3f5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)