To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 蠍晉垈蹂キ蠍晄逧キ[蠍晉垈蹂キ蠍晄逧キ[^ 111001011011011010011101111001111001101010110000111001101111100010110111111001011011011010011101111001101111100010110000111001111001101010110111010110111110010110110110100111011110011110011010101100001110011011111000101101111110010110110110100111011110011011111000101100001110011110011010101101110101101101011110 e5b69de79ab0e6f8b7e5b69de6f8b0e79ab75be5b69de79ab0e6f8b7e5b69de6f8b0e79ab75b5e
EUC-JP 蠍晉垈蹂キ蠍晄?逧キ[蠍晉垈蹂キ蠍晄?逧キ[^ 1110101010111000110110101110100111010100101100101110110011111010100011101011011111101010101110001101101011101000001111111110110111111010100011101011011101011011111010101011100011011010111010011101010010110010111011001111101010001110101101111110101010111000110110101110100000111111111011011111101010001110101101110101101101011110 eab8dae9d4b2ecfa8eb7eab8dae83fedfa8eb75beab8dae9d4b2ecfa8eb7eab8dae83fedfa8eb75b5e
UTF-8 蠍晉垈蹂キ蠍晄逧キ[蠍晉垈蹂キ蠍晄逧キ[^ 111010001010000010001101111001101001100110001001111001011001111010001000111010001011100110000010111011111011110110110111111010001010000010001101111001101001100110000100111011101001100110001111111010011000000010100111111011111011110110110111010110111110100010100000100011011110011010011001100010011110010110011110100010001110100010111001100000101110111110111101101101111110100010100000100011011110011010011001100001001110111010011001100011111110100110000000101001111110111110111101101101110101101101011110 e8a08de69989e59e88e8b982efbdb7e8a08de69984ee998fe980a7efbdb75be8a08de69989e59e88e8b982efbdb7e8a08de69984ee998fe980a7efbdb75b5e
UHC ?晉垈蹂??晄???[?晉垈蹂??晄???[^ 00111111111100101100101111010011110111001110101110110011001111110011111111111100110011010011111100111111001111110101101100111111111100101100101111010011110111001110101110110011001111110011111111111100110011010011111100111111001111110101101101011110 3ff2cbd3dcebb33f3ffccd3f3f3f5b3ff2cbd3dcebb33f3ffccd3f3f3f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)