To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 炭促其箪達続B 10010010010110011001000110100011100100011011010010010010010111001001001001000010100100011011000101000010 925991a391b4925c924291b142
EUC-JP 炭促其箪達続B 11000011101110101100001010100101110000101011011011000011101111011100001110100011110000101011001101000010 c3bac2a5c2b6c3bdc3a3c2b342
UTF-8 炭促其箪達続B 11100111100000101010110111100100101111111000001111100101100001011011011011100111101011101010101011101001100000011001010011100111101101101001101001000010 e782ade4bf83e585b6e7aeaae98194e7b69a42
UHC 炭促其?達?B 1111011110101001111101011011010111010000111011000011111111010011101110010011111101000010 f7a9f5b5d0ec3fd3b93f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)