To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 憶??鎰??伎泣 100010011010111100111111001111111110100001001100001111110011111110001010111010101000101110000011 89af3f3fe84c3f3f8aea8b83
EUC-JP 憶??鎰??伎泣 101100101011000100111111001111111110111110101101001111110011111110110100111011001011010111100011 b2b13f3fefad3f3fb4ecb5e3
UTF-8 憶귣쪋鎰꾦뇦伎泣 111001101000011010110110111010101011011110100011111011001010101010001011111010011000111010110000111010101011111010100110111010111000011110100110111001001011110010001110111001101011001110100011 e686b6eab7a3ecaa8be98eb0eabea6eb87a6e4bc8ee6b3a3
UHC 憶귣쪋鎰꾦뇦伎泣 11100101111000111000001011101011101001011000010111101100111100001000010011101001100001111000111011010000111010111110101111101000 e5e382eba585ecf084e9878ed0ebebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)