To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 橈??寥?????絶??絶??節?9節??^ 1001111011110100001111110011111110011011100011000011111100111111001111110011111100111111100100001110001000111111001111111001000011100010001111110011111110010000110111110011111110000010010110001001000011011111001111110011111101011110 9ef43f3f9b8c3f3f3f3f3f90e23f3f90e23f3f90df3f825890df3f3f5e
EUC-JP 橈??寥??獒??絶??絶??節?9節??^ 11011100111101100011111100111111110101011110110000111111001111111000111111001011101110110011111100111111110000001110010000111111001111111100000011100100001111110011111111000000111000010011111110100011101110011100000011100001001111110011111101011110 dcf63f3fd5ec3f3f8fcbbb3f3fc0e43f3fc0e43f3fc0e13fa3b9c0e13f3f5e
UTF-8 橈놅풛寥낉쉽獒뱄슁絶욇쐠絶욇몗節김9節놂풙^ 11100110101010011000100011101011100001101000010111101101100100101001101111100101101011111010010111101011100000101000100111101100100010011011110111100111100011011001001011101011101100011000010011101100100010101000000111100111101101011011011011101100100110101000011111101100100100001010000011100111101101011011011011101100100110101000011111101011101010101001011111100111101011111000000011101010101110011000000011101111101111001001100111100111101011111000000011101011100001101000001011101101100100101001100101011110 e6a988eb8685ed929be5afa5eb8289ec89bde78d92ebb184ec8a81e7b5b6ec9a87ec90a0e7b5b6ec9a87ebaa97e7af80eab980efbc99e7af80eb8682ed92995e
UHC 橈놅풛寥낉쉽獒뱄슁絶욇쐠絶욇몗節김9節놂풙^ 11101000111110101000011011101111101111101001111011101000111011111000010111101111101111011011000111101000101000111011100111101111101111011011001111101111101111101001111011101001100111001000011011101111101111101001111011101001100100011000010111101111101111011011000111101000101000111011100111101111101111011011001111101111101111101001110001011110 e8fa86efbe9ee8ef85efbdb1e8a3b9efbdb3efbe9ee99c86efbe9ee99185efbdb1e8a3b9efbdb3efbe9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)