To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN リ「跖章イリ「菽B 1101100010100010111001101110100010001111110011011011001011011000101000101110010011000001111100111010010001000010 d8a2e6e88fcdb2d8a2e4c1f3a442
EUC-JP リ「跖章イリ「菽?B 100011101101100010001110101000101110110011101010101111101100111110001110101100101000111011011000100011101010001011101000110000110011111101000010 8ed88ea2eceabecf8eb28ed88ea2e8c33f42
UTF-8 リ「跖章イリ「菽B 11101111101111101001100011101111101111011010001011101000101101111001011011100111101010111010000011101111101111011011001011101111101111101001100011101111101111011010001011101000100011111011110111101110100010101001011101000010 efbe98efbda2e8b796e7aba0efbdb2efbe98efbda2e88fbdee8a9742
UHC ???章???菽?B 001111110011111100111111111011011111000100111111001111110011111111100010110111010011111101000010 3f3f3fedf13f3f3fe2dd3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)