To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰??節??汚 1110000110011111001111110011111111100110111110100011111100111111100100001101111100111111001111111000100110011000 e19f3f3fe6fa3f3f90df3f3f8998
EUC-JP 癲??踰??節??汚 1110001010100001001111110011111111101100111111000011111100111111110000001110000100111111001111111011000111111000 e2a13f3fecfc3f3fc0e13f3fb1f8
UTF-8 癲ㅻ슢踰좑쭫節됰눛汚 111001111001100110110010111000111000010110111011111011001000101010100010111010001011100010110000111011001010001010010001111011001010110110101011111001111010111110000000111010111001000010110000111010111000100010011011111001101011000110011010 e799b2e385bbec8aa2e8b8b0eca291ecadabe7af80eb90b0eb889be6b19a
UHC 癲ㅻ슢踰좑쭫節됰눛汚 1110111110100110101001001110101110011010101011101110101110110010101000001110111110100111100111111110111110111101100010011110101110000111101100111110011111111101 efa6a4eb9aaeebb2a0efa79fefbd89eb87b3e7fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)