To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???橈??松??節??節??與??橈??^ 00111111001111110011111110011110111101000011111100111111100011111011110000111111001111111001000011011111001111110011111110010000110111110011111100111111111001000110111100111111001111111001111011110100001111110011111101011110 3f3f3f9ef43f3f8fbc3f3f90df3f3f90df3f3fe46f3f3f9ef43f3f5e
EUC-JP ???橈??松??節??節??與??橈??^ 00111111001111110011111111011100111101100011111100111111101111101011111000111111001111111100000011100001001111110011111111000000111000010011111100111111111001111101000000111111001111111101110011110110001111110011111101011110 3f3f3fdcf63f3fbebe3f3fc0e13f3fc0e13f3fe7d03f3fdcf63f3f5e
UTF-8 遼놅슛橈롦㉥松잓쐠節면쵄節면쐩與뜯닟橈놅슛^ 11101111101001111000001111101011100001101000010111101100100010101001101111100110101010011000100011101011101000011010011011100011100010011010010111100110100111011011111011101100100111101001001111101100100100001010000011100111101011111000000011101011101010011011010011101100101101011000010011100111101011111000000011101011101010011011010011101100100100001010100111101000100010001000011111101011100111001010111111101011100010111001111111100110101010011000100011101011100001101000010111101100100010101001101101011110 efa783eb8685ec8a9be6a988eba1a6e389a5e69dbeec9e93ec90a0e7af80eba9b4ecb584e7af80eba9b4ec90a9e88887eb9cafeb8b9fe6a988eb8685ec8a9b5e
UHC 遼놅슛橈롦㉥松잓쐠節면쵄節면쐩與뜯닟橈놅슛^ 11101001101011001000011011101111101111011011100011101000111110101000111011100110101010001011011011100001111001101001111111101001100111001000011011101111101111011011100011101001101011001000011011101111101111011011100011101001100111001000111011100110101010001011011011100010100010001001111111101000111110101000011011101111101111011011100001011110 e9ac86efbdb8e8fa8ee6a8b6e1e69fe99c86efbdb8e9ac86efbdb8e99c8ee6a8b6e2889fe8fa86efbdb85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)