To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??認??????レ?諭??音?シ筌 1110001010100011001111110011111110010100010001100011111100111111001111110011111100111111001111111000001110001100001111111001011101000000001111110011111110001001101110010011111110000011010101101110001010100011 e2a33f3f94463f3f3f3f3f3f838c3f97403f3f89b93f8356e2a3
EUC-JP 筌??認??洧???レ?諭??音?シ筌 11100100101001010011111100111111110001111010011100111111001111111000111111000111101101000011111100111111001111111010010111101100001111111100110110100001001111110011111110110010101110110011111110100101101101111110010010100101 e4a53f3fc7a73f3f8fc7b43f3f3fa5ec3fcda13f3fb2bb3fa5b7e4a5
UTF-8 筌뚮뱷認뗰ℓ洧댁뿉曆レ쥉諭좑쭓音섏シ筌 111001111010110110001100111010111001101010101110111010111011000110110111111010001010101010001101111010111001011110110000111000101000010010010011111001101011010010100111111010111000110010000001111010111011111110001001111011111010011010001011111000111000001110101100111011001010010110001001111010001010101110101101111011001010001010010001111011001010110110010011111010011001111110110011111011001000010010001111111000111000001010110111111001111010110110001100 e7ad8ceb9aaeebb1b7e8aa8deb97b0e28493e6b4a7eb8c81ebbf89efa68be383aceca589e8abadeca291ecad93e99fb3ec848fe382b7e7ad8c
UHC 筌뚮뱷認뗰ℓ洧댁뿉曆レ쥉諭좑쭓音섏シ筌 1110111110100111100011001110101110010011100111011110110011100011100010111110111110100111101001001110101011111011101101001110110010010111100100001110011010110111101010111110110010100010100000101110101110110001101000001110111110100111100010111110101111100101100110001110110010101011101101111110111110100111 efa78ceb939dece38befa7a4eafbb4ec9790e6b7abeca282ebb1a0efa78bebe598ecabb7efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)