To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???援???????????援????????B 001111110011111100111111100010011000011100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000100110000111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f89873f3f3f3f3f3f3f3f3f3f3f89873f3f3f3f3f3f3f3f42
EUC-JP ???援???????????援????????B 001111110011111100111111101100011110011100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000111100111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3fb1e73f3f3f3f3f3f3f3f3f3f3fb1e73f3f3f3f3f3f3f3f42
UTF-8 嶺뚮뱪援㎪끽戮녈럷烈쀪탻嶺뚮뱪援㎪끽戮녈럷烈쀪탻B 11101111101001101010101111101011100110101010111011101011101100011010101011100110100011111011010011100011100011101010101011101011100000011011110111101111101001111001001011101011100001011000100011101011100111111011011111101111101001101001111111101100100000001010101011101101100000111011101111101111101001101010101111101011100110101010111011101011101100011010101011100110100011111011010011100011100011101010101011101011100000011011110111101111101001111001001011101011100001011000100011101011100111111011011111101111101001101001111111101100100000001010101011101101100000111011101101000010 efa6abeb9aaeebb1aae68fb4e38eaaeb81bdefa792eb8588eb9fb7efa69fec80aaed83bbefa6abeb9aaeebb1aae68fb4e38eaaeb81bdefa792eb8588eb9fb7efa69fec80aaed83bb42
UHC 嶺뚮뱪援㎪끽戮녈럷烈쀪탻嶺뚮뱪援㎪끽戮녈럷烈쀪탻B 11100111101011011000110011101011100100111001000011101010101101011010011111100110101100111010001111101011101111011011001111100011100011101001011011100110111011111001011111101010101101011001011111100111101011011000110011101011100100111001000011101010101101011010011111100110101100111010001111101011101111011011001111100011100011101001011011100110111011111001011111101010101101011001011101000010 e7ad8ceb9390eab5a7e6b3a3ebbdb3e38e96e6ef97eab597e7ad8ceb9390eab5a7e6b3a3ebbdb3e38e96e6ef97eab59742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)