To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 醇袒善醇袒善B 10001111100001101110010111010110100100010101000010001111100001101110010111010110100100010101000001000010 8f86e5d691508f86e5d6915042
EUC-JP 醇袒善醇袒善B 10111101111001101110101011011000110000011011000110111101111001101110101011011000110000011011000101000010 bde6ead8c1b1bde6ead8c1b142
UTF-8 醇袒善醇袒善B 11101001100001101000011111101000101000101001001011100101100101101000010011101001100001101000011111101000101000101001001011100101100101101000010001000010 e98687e8a292e59684e98687e8a292e5968442
UHC 醇袒善醇袒善B 11100010111101011101001110110010111000001011110011100010111101011101001110110010111000001011110001000010 e2f5d3b2e0bce2f5d3b2e0bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)