To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 荳茨セ堤⊂訒ッ霎桀N}荳茨セ堤⊂訒ッ霎桀N{^ 11100100101110001000100011101111101111101001001011100111100000011011110011111011101000111010111111101000101111101001111001111011010011100111110111100100101110001000100011101111101111101001001011100111100000011011110011111011101000111010111111101000101111101001111001111011010011100111101101011110 e4b888efbe92e781bcfba3afe8be9e7b4e7de4b888efbe92e781bcfba3afe8be9e7b4e7b5e
EUC-JP 荳茨セ堤⊂訒ッ霎桀N}荳茨セ堤⊂訒ッ霎桀N{^ 11101000101110101011000011110001100011101011111011000100111010011010001010111110100011111101110111001000100011101010111111110000110000001101101111011100010011100111110111101000101110101011000011110001100011101011111011000100111010011010001010111110100011111101110111001000100011101010111111110000110000001101101111011100010011100111101101011110 e8bab0f18ebec4e9a2be8fddc88eaff0c0dbdc4e7de8bab0f18ebec4e9a2be8fddc88eaff0c0dbdc4e7b5e
UTF-8 荳茨セ堤⊂訒ッ霎桀N}荳茨セ堤⊂訒ッ霎桀N{^ 1110100010001101101100111110100010001100101010001110111110111101101111101110010110100000101001001110001010001010100000101110100010101000100100101110111110111101101011111110100110011100100011101110011010100001100000000100111001111101111010001000110110110011111010001000110010101000111011111011110110111110111001011010000010100100111000101000101010000010111010001010100010010010111011111011110110101111111010011001110010001110111001101010000110000000010011100111101101011110 e88db3e88ca8efbdbee5a0a4e28a82e8a892efbdafe99c8ee6a1804e7de88db3e88ca8efbdbee5a0a4e28a82e8a892efbdafe99c8ee6a1804e7b5e
UHC 荳茨?堤⊂???桀N}荳茨?堤⊂???桀N{^ 110101001110010111101101101111000011111111110000101001111010000111111000001111110011111100111111110010111111101001001110011111011101010011100101111011011011110000111111111100001010011110100001111110000011111100111111001111111100101111111010010011100111101101011110 d4e5edbc3ff0a7a1f83f3f3fcbfa4e7dd4e5edbc3ff0a7a1f83f3f3fcbfa4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)