To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 歪??艤??揄??[歪??艤??揄??[^ 100110000110001100111111001111111110010001111110001111110011111110011101100010010011111100111111010110111001100001100011001111110011111111100100011111100011111100111111100111011000100100111111001111110101101101011110 98633f3fe47e3f3f9d893f3f5b98633f3fe47e3f3f9d893f3f5b5e
EUC-JP 歪??艤??揄??[歪??艤??揄??[^ 110011111100010000111111001111111110011111011111001111110011111111011001111010010011111100111111010110111100111111000100001111110011111111100111110111110011111100111111110110011110100100111111001111110101101101011110 cfc43f3fe7df3f3fd9e93f3f5bcfc43f3fe7df3f3fd9e93f3f5b5e
UTF-8 歪뺣벩艤욥굲揄우쑠[歪뺣벩艤욥굲揄우쑠[^ 111001101010110110101010111010111011101010100011111010111011001010101001111010001000100110100100111011001001101010100101111010101011010110110010111001101000111110000100111011001001101010110000111011001001000110100000010110111110011010101101101010101110101110111010101000111110101110110010101010011110100010001001101001001110110010011010101001011110101010110101101100101110011010001111100001001110110010011010101100001110110010010001101000000101101101011110 e6adaaebbaa3ebb2a9e889a4ec9aa5eab5b2e68f84ec9ab0ec91a05be6adaaebbaa3ebb2a9e889a4ec9aa5eab5b2e68f84ec9ab0ec91a05b5e
UHC 歪뺣벩艤욥굲揄우쑠[歪뺣벩艤욥굲揄우쑠[^ 111010001110000010010101111010111001001110111111111010111111101010111111111010011000001010010101111010101111000110111111111011001001110010111111010110111110100011100000100101011110101110010011101111111110101111111010101111111110100110000010100101011110101011110001101111111110110010011100101111110101101101011110 e8e095eb93bfebfabfe98295eaf1bfec9cbf5be8e095eb93bfebfabfe98295eaf1bfec9cbf5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)