To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????TE?????????TEB 0011111100111111001111110011111100111111001111110011111100111111001111110101010001000101001111110011111100111111001111110011111100111111001111110011111100111111010101000100010101000010 3f3f3f3f3f3f3f3f3f54453f3f3f3f3f3f3f3f3f544542
SJIS-WIN ??衛??爲??やTE??衛??爲??やTEB 0011111100111111100010010111000100111111001111111110000010101000001111110011111110000010111000100101010001000101001111110011111110001001011100010011111100111111111000001010100000111111001111111000001011100010010101000100010101000010 3f3f89713f3fe0a83f3f82e254453f3f89713f3fe0a83f3f82e2544542
EUC-JP ??衛??爲??やTE??衛??爲??やTEB 0011111100111111101100011101001000111111001111111110000010101010001111110011111110100100111001000101010001000101001111110011111110110001110100100011111100111111111000001010101000111111001111111010010011100100010101000100010101000010 3f3fb1d23f3fe0aa3f3fa4e454453f3fb1d23f3fe0aa3f3fa4e4544542
UTF-8 룵에衛룵에爲룵卽やTE룵에衛룵에爲룵卽やTEB 1110101110100011101101011110110010010111100100001110100010100001100110111110101110100011101101011110110010010111100100001110011110001000101100101110101110100011101101011110010110001101101111011110001110000010100001000101010001000101111010111010001110110101111011001001011110010000111010001010000110011011111010111010001110110101111011001001011110010000111001111000100010110010111010111010001110110101111001011000110110111101111000111000001010000100010101000100010101000010 eba3b5ec9790e8a19beba3b5ec9790e788b2eba3b5e58dbde382845445eba3b5ec9790e8a19beba3b5ec9790e788b2eba3b5e58dbde38284544542
UHC 룵에衛룵에爲룵卽やTE룵에衛룵에爲룵卽やTEB 1000111110101010101111111010000111101010110110111000111110101010101111111010000111101010110100111000111110101010111100011110110110101010111001000101010001000101100011111010101010111111101000011110101011011011100011111010101010111111101000011110101011010011100011111010101011110001111011011010101011100100010101000100010101000010 8faabfa1eadb8faabfa1ead38faaf1edaae454458faabfa1eadb8faabfa1ead38faaf1edaae4544542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)