To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疫??嵬??奧??外 1000100101110101001111110011111110011011110010100011111100111111100110101111101000111111001111111000101001001111 89753f3f9bca3f3f9afa3f3f8a4f
EUC-JP 疫??嵬??奧??外 1011000111010110001111110011111111010110110011000011111100111111110101001111110000111111001111111011001110110000 b1d63f3fd6cc3f3fd4fc3f3fb3b0
UTF-8 疫섓푾嵬뚪윻奧딃뀒外 111001111001011010101011111011001000010010010011111011011001000110111110111001011011010110101100111010111001101010101010111011001001110010111011111001011010010110100111111010111001010010000011111010111000000010010010111001011010010010010110 e796abec8493ed91bee5b5aceb9aaaec9cbbe5a5a7eb9483eb8092e5a496
UHC 疫섓푾嵬뚪윻奧딃뀒外 1110011010111001100110001110111110111110100010011110100011100011100011001110100110011111101101011110011111110011100010101110100110000101100011001110100011100010 e6b998efbe89e8e38ce99fb5e7f38ae9858ce8e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)