To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???倚??醫??熬??源??揄щ?阿 0011111100111111001111111001100011011111001111110011111111100111110011100011111100111111111000001001001000111111001111111000110010111001001111110011111110011101100010011000010010001011001111111000100010100010 3f3f3f98df3f3fe7ce3f3fe0923f3f8cb93f3f9d89848b3f88a2
EUC-JP ???倚??醫??熬??源??揄щ?阿 0011111100111111001111111101000011100001001111110011111111101110110100000011111100111111110111111111001000111111001111111011100010111011001111110011111111011001111010011010011111101011001111111011000010100100 3f3f3fd0e13f3feed03f3fdff23f3fb8bb3f3fd9e9a7eb3fb0a4
UTF-8 嶺뚮벀倚닷퐲醫묆룋熬곥굦源뉒춯揄щ룊阿 1110111110100110101010111110101110011010101011101110101110110010100000001110010110000000100110101110101110001011101101111110110110010000101100101110100110000110101010111110101110101100100001101110101110100011100010111110011110000110101011001110101010110011101001011110101010110101101001101110011010111010100100001110101110001001100100101110110010110110101011111110011010001111100001001101000110001001111010111010001110001010111010011001100010111111 efa6abeb9aaeebb280e5809aeb8bb7ed90b2e986abebac86eba38be786aceab3a5eab5a6e6ba90eb8992ecb6afe68f84d189eba38ae998bf
UHC 嶺뚮벀倚닷퐲醫묆룋熬곥굦源뉒춯揄щ룊阿 1110011110101101100011001110101110010011101001101110101111101111101101001110010110111101100110111110110010100010100100011110001110001111100010101110100010100010100000011110001110000010100011001110101010111001100001111110011110101101100011001110101011110001101011001110101110001111100010011110010010111001 e7ad8ceb93a6ebefb4e5bd9beca291e38f8ae8a281e3828ceab987e7ad8ceaf1aceb8f89e4b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)