To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 怏?┨腰??腰?┨B 100111001000100100111111100001001011011110001101100110000011111100111111100011011001100000111111100001001011011101000010 9c893f84b78d983f3f8d983f84b742
EUC-JP 怏?┨腰??腰?┨B 110101111110100100111111101010001011100110111001111110000011111100111111101110011111100000111111101010001011100101000010 d7e93fa8b9b9f83f3fb9f83fa8b942
UTF-8 怏좈┨腰밧뒥腰백┨B 11100110100000001000111111101100101000101000100011100010100101001010100011101000100001011011000011101011101100001010011111101011100100101010010111101000100001011011000011101011101100001011000111100010100101001010100001000010 e6808feca288e294a8e885b0ebb0a7eb92a5e885b0ebb0b1e294a842
UHC 怏좈┨腰밧뒥腰백┨B 11100100111010001010000011101001101001101011100111101001101001101011100111100101100010101010000011101001101001101011100111101001101001101011100101000010 e4e8a0e9a6b9e9a6b9e58aa0e9a6b9e9a6b942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)