To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 短造測短俗損B 10010010010110101001000110100010100100011010101010010010010110101001000110101101100100011011100101000010 925a91a291aa925a91ad91b942
EUC-JP 短造測短俗損B 11000011101110111100001010100100110000101010110011000011101110111100001010101111110000101011101101000010 c3bbc2a4c2acc3bbc2afc2bb42
UTF-8 短造測短俗損B 11100111100111111010110111101001100000001010000011100110101110001010110011100111100111111010110111100100101111111001011111100110100100001000110101000010 e79fade980a0e6b8ace79fade4bf97e6908d42
UHC 短造測短俗損B 11010011101011011111000011100011111101101011010011010011101011011110000111010100111000011101111101000010 d3adf0e3f6b4d3ade1d4e1df42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)