To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蚓?????將????蚓?????將????^ 111001010110110100111111001111110011111100111111001111111001101110010010001111110011111100111111001111111110010101101101001111110011111100111111001111110011111110011011100100100011111100111111001111110011111101011110 e56d3f3f3f3f3f9b923f3f3f3fe56d3f3f3f3f3f9b923f3f3f3f5e
EUC-JP 蚓?????將????蚓?????將????^ 111010011100111000111111001111110011111100111111001111111101010111110010001111110011111100111111001111111110100111001110001111110011111100111111001111110011111111010101111100100011111100111111001111110011111101011110 e9ce3f3f3f3f3fd5f23f3f3f3fe9ce3f3f3f3f3fd5f23f3f3f3f5e
UTF-8 蚓쇳렖곡렧렧將쇳렖겼㉢蚓쇳렖곡렧렧將쇳렖겼㉢^ 11101000100110101001001111101100100001111011001111101011101000001001011011101010101100111010000111101011101000001010011111101011101000001010011111100101101100001000011111101100100001111011001111101011101000001001011011101010101100101011110011100011100010011010001011101000100110101001001111101100100001111011001111101011101000001001011011101010101100111010000111101011101000001010011111101011101000001010011111100101101100001000011111101100100001111011001111101011101000001001011011101010101100101011110011100011100010011010001001011110 e89a93ec87b3eba096eab3a1eba0a7eba0a7e5b087ec87b3eba096eab2bce389a2e89a93ec87b3eba096eab3a1eba0a7eba0a7e5b087ec87b3eba096eab2bce389a25e
UHC 蚓쇳렖곡렧렧將쇳렖겼㉢蚓쇳렖곡렧렧將쇳렖겼㉢^ 111011001110001010111100111011011000111010101011101100001110111010001110101101101000111010110110111011011110001010111100111011011000111010101011101100001110010110101000101100111110110011100010101111001110110110001110101010111011000011101110100011101011011010001110101101101110110111100010101111001110110110001110101010111011000011100101101010001011001101011110 ece2bced8eabb0ee8eb68eb6ede2bced8eabb0e5a8b3ece2bced8eabb0ee8eb68eb6ede2bced8eabb0e5a8b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)