To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????\ 00111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f5c
SJIS-WIN ?舌夭?舌?\ 00111111100100001110001110011010111011100011111110010000111000110011111101011100 3f90e39aee3f90e33f5c
EUC-JP 炤舌夭炤舌?\ 1000111111001001110100101100000011100101110101001111000010001111110010011101001011000000111001010011111101011100 8fc9d2c0e5d4f08fc9d2c0e53f5c
UTF-8 炤舌夭炤舌섍\ 11100111100000101010010011101000100010001000110011100101101001001010110111100111100000101010010011101000100010001000110011101100100001001000110101011100 e782a4e8888ce5a4ade782a4e8888cec848d5c
UHC 炤舌夭炤舌섍\ 11100001101111111110000011011111111010001110110011100001101111111110000011011111100110001110101001011100 e1bfe0dfe8ece1bfe0df98ea5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)