To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8倚??猷??閻??日??兢踰ε? 11100001100111110011111110000010010101111001100011011111001111110011111110010111010100010011111100111111111010001000010100111111001111111001001111111010001111110011111110011001010111011110011011111010100000111100001100111111 e19f3f825798df3f3f97513f3fe8853f3f93fa3f3f995de6fa83c33f
EUC-JP 癲?8倚??猷??閻??日??兢踰ε? 11100010101000010011111110100011101110001101000011100001001111110011111111001101101100100011111100111111111011111110010100111111001111111100011011111100001111110011111111010001101111101110110011111100101001101100010100111111 e2a13fa3b8d0e13f3fcdb23f3fefe53f3fc6fc3f3fd1beecfca6c53f
UTF-8 癲쒕8倚쒙㎟猷앺맩閻롫챿日뗩땡兢踰ε쳞 1110011110011001101100101110110010010010100101011110111110111100100110001110010110000000100110101110110010010010100110011110001110001110100111111110011110001100101101111110110010010101101110101110101110100111101010011110100110010110101110111110101110100001101010111110110010110001101111111110011010010111101001011110101110010111101010011110101110010101101000011110010110000101101000101110100010111000101100001100111010110101111011001011001110011110 e799b2ec9295efbc98e5809aec9299e38e9fe78cb7ec95baeba7a9e996bbeba1abecb1bfe697a5eb97a9eb95a1e585a2e8b8b0ceb5ecb39e
UHC 癲쒕8倚쒙㎟猷앺맩閻롫챿日뗩땡兢踰ε쳞 1110111110100110100111001110101110100011101110001110101111101111100111001110111110100111101100011110101110100011100111011110110110010000101100011110011110100010100011101110101110101010100011001110110011101101100010111110100110110110101011111101000011100111111010111011001010100101111001011010101110000100 efa69ceba3b8ebef9cefa7b1eba39ded90b1e7a28eebaa8ceced8be9b6afd0e7ebb2a5e5ab84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)