To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 箋??癲??夜??鈺 1110001010110011001111110011111111100001100111110011111100111111100101101110100100111111001111111111101111000100 e2b33f3fe19f3f3f96e93f3ffbc4
EUC-JP 箋??癲??夜??鈺 111001001011010100111111001111111110001010100001001111110011111111001100111010110011111100111111100011111110001111010101 e4b53f3fe2a13f3fcceb3f3f8fe3d5
UTF-8 箋덃췃癲숋쫶夜껋닁鈺 111001111010111010001011111010111000110110000011111011001011011110000011111001111001100110110010111011001000100010001011111011001010101110110110111001011010010010011100111010101011101110001011111010111000101110000001111010011000100010111010 e7ae8beb8d83ecb783e799b2ec888becabb6e5a49ceabb8beb8b81e988ba
UHC 箋덃췃癲숋쫶夜껋닁鈺 1110111110101000100010001110011010101101100111111110111110100110100110011110111110100110100011011110010110101000100000111110110010001000100010101110100010101101 efa888e6ad9fefa699efa68de5a883ec888ae8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)