To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臟堪????m雀宏臟堪????m雀槐^ 1110010001100110100010101010110000111111001111110011111100111111100000101000110110010000100111011000110101000111111001000110011010001010101011000011111100111111001111110011111110000010100011011001000010011101100111101100010101011110 e4668aac3f3f3f3f828d909d8d47e4668aac3f3f3f3f828d909d9ec55e
EUC-JP 臟堪????m雀宏臟堪????m雀槐^ 1110011111000111101101001010111000111111001111110011111100111111101000111110110110111111111111011011100110101000111001111100011110110100101011100011111100111111001111110011111110100011111011011011111111111101110111001100011101011110 e7c7b4ae3f3f3f3fa3edbffdb9a8e7c7b4ae3f3f3f3fa3edbffddcc75e
UTF-8 臟堪렓렡뤋퀚m雀宏臟堪렓렡뤋퀚m雀槐^ 11101000100001111001111111100101101000001010101011101011101000001001001111101011101000001010000111101011101001001000101111101101100000001001101011101111101111011000110111101001100110111000000011100101101011101000111111101000100001111001111111100101101000001010101011101011101000001001001111101011101000001010000111101011101001001000101111101101100000001001101011101111101111011000110111101001100110111000000011100110101001111001000001011110 e8879fe5a0aaeba093eba0a1eba48bed809aefbd8de99b80e5ae8fe8879fe5a0aaeba093eba0a1eba48bed809aefbd8de99b80e6a7905e
UHC 臟堪렓렡뤋퀚m雀宏臟堪렓렡뤋퀚m雀槐^ 11101101111101001100101011101101100011101010100010001110101100101000111110111011101100111000111010100011111011011110110111001101110011101101101111101101111101001100101011101101100011101010100010001110101100101000111110111011101100111000111010100011111011011110110111001101110011101101100101011110 edf4caed8ea88eb28fbbb38ea3ededcdcedbedf4caed8ea88eb28fbbb38ea3ededcdced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)