To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃??泣??臾??? 10001000101000010011111100111111100010111000001100111111001111111110010001101011001111110011111100111111 88a13f3f8b833f3fe46b3f3f3f
EUC-JP 娃??泣??臾??? 10110000101000110011111100111111101101011110001100111111001111111110011111001100001111110011111100111111 b0a33f3fb5e33f3fe7cc3f3f3f
UTF-8 娃숈궪泣먩뿿臾뺤돹黎 111001011010100010000011111011001000100010001000111010101011011010101010111001101011001110100011111010111010100010101001111010111011111110111111111010001000011110111110111010111011101010100100111010111000111110111001111011111010011010001001 e5a883ec8888eab6aae6b3a3eba8a9ebbfbfe887beebbaa4eb8fb9efa689
UHC 娃숈궪泣먩뿿臾뺤돹黎 1110100011011111100110011110110010000010101111001110101111101000100100001110011010010111101111111110101110101100100101011110110010001001101111001110011010110001 e8df99ec82bcebe890e697bfebac95ec89bce6b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)