To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????醫??域?????????碎??域 001111110011111100111111001111110011111100111111111001111100111000111111001111111000100011100110001111110011111100111111001111110011111100111111001111110011111100111111111000011110101000111111001111111000100011100110 3f3f3f3f3f3fe7ce3f3f88e63f3f3f3f3f3f3f3f3fe1ea3f3f88e6
EUC-JP ??????醫??域?????????碎??域 001111110011111100111111001111110011111100111111111011101101000000111111001111111011000011101000001111110011111100111111001111110011111100111111001111110011111100111111111000101110110000111111001111111011000011101000 3f3f3f3f3f3feed03f3fb0e83f3f3f3f3f3f3f3f3fe2ec3f3fb0e8
UTF-8 嶺뚮벀流ⓨ퐲醫묆룋域민노쟽嶺뚮벀流⒳튃碎ⓦ룋域 111011111010011010101011111010111001101010101110111010111011001010000000111011111010011110001010111000101001001110101000111011011001000010110010111010011000011010101011111010111010110010000110111010111010001110001011111001011001111110011111111010111010111110111100111010111000010110111000111011001001111110111101111011111010011010101011111010111001101010101110111010111011001010000000111011111010011110001010111000101001001010110011111011011000101010000011111001111010001010001110111000101001001110100110111010111010001110001011111001011001111110011111 efa6abeb9aaeebb280efa78ae293a8ed90b2e986abebac86eba38be59f9febafbceb85b8ec9fbdefa6abeb9aaeebb280efa78ae292b3ed8a83e7a28ee293a6eba38be59f9f
UHC 嶺뚮벀流ⓨ퐲醫묆룋域민노쟽嶺뚮벀流⒳튃碎ⓦ룋域 11100111101011011000110011101011100100111010011011101010111111001010100011100101101111011001101111101100101000101001000111100011100011111000101011100110101101001011100111001110101100111110101110100000100000111110011110101101100011001110101110010011101001101110101011111100101010011110010010111001100110011110000111101111101010001110001110001111100010101110011010110100 e7ad8ceb93a6eafca8e5bd9beca291e38f8ae6b4b9ceb3eba083e7ad8ceb93a6eafca9e4b999e1efa8e38f8ae6b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)