To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゅ?餓??言?? 10011010110010001000001011100011001111111000100111101100001111110011111110001100101111100011111100111111 9ac882e33f89ec3f3f8cbe3f3f
EUC-JP 塋ゅ?餓??言?? 11010100110010101010010011100101001111111011001011101110001111110011111110111000110000000011111100111111 d4caa4e53fb2ee3f3fb8c03f3f
UTF-8 塋ゅ끀餓뽬꽦言됪릫 111001011010000110001011111000111000001010000101111010111000000110000000111010011010010010010011111010111011110110101100111010101011110110100110111010001010100010000000111010111001000010101010111010111010011010101011 e5a18be38285eb8180e9a493ebbdaceabda6e8a880eb90aaeba6ab
UHC 塋ゅ끀餓뽬꽦言됪릫 111001111010101110101010111001011000010110110110111001001011101110010110111010001000010010110001111001011110101110001001111001101001000010001101 e7abaae585b6e4bb96e884b1e5eb89e6908d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)