To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 典?旨???s砥?雋梨典?旨???s砥?雋悧^ 1001001101010100001111111000111001111100001111110011111100111111100000101001001110010011011101010011111111101000101100101001011110011100100100110101010000111111100011100111110000111111001111110011111110000010100100111001001101110101001111111110100010110010100111001010010001011110 93543f8e7c3f3f3f829393753fe8b2979c93543f8e7c3f3f3f829393753fe8b29ca45e
EUC-JP 典?旨???s砥?雋梨典?旨???s砥?雋悧^ 1100010110110101001111111011101111011101001111110011111100111111101000111111001111000101110101100011111111110000101101001100110111111100110001011011010100111111101110111101110100111111001111110011111110100011111100111100010111010110001111111111000010110100110110001010011001011110 c5b53fbbdd3f3f3fa3f3c5d63ff0b4cdfcc5b53fbbdd3f3f3fa3f3c5d63ff0b4d8a65e
UTF-8 典렗旨곈렦ㅺs砥렫雋梨典렗旨곈렦ㅺs砥렫雋悧^ 11100101100001011011100011101011101000001001011111100110100101111010100011101010101100111000100011101011101000001010011011100011100001011011101011101111101111011001001111100111101000001010010111101011101000001010101111101001100110111000101111100110101000101010100011100101100001011011100011101011101000001001011111100110100101111010100011101010101100111000100011101011101000001010011011100011100001011011101011101111101111011001001111100111101000001010010111101011101000001010101111101001100110111000101111100110100000101010011101011110 e585b8eba097e697a8eab388eba0a6e385baefbd93e7a0a5eba0abe99b8be6a2a8e585b8eba097e697a8eab388eba0a6e385baefbd93e7a0a5eba0abe99b8be682a75e
UHC 典렗旨곈렦ㅺs砥렫雋梨典렗旨곈렦ㅺs砥렫雋悧^ 111011101111000010001110101011001111001010101001101100001110100110001110101101011010010011101010101000111111001111110010101100101000111010111001111100011110011011010111110111101110111011110000100011101010110011110010101010011011000011101001100011101011010110100100111010101010001111110011111100101011001010001110101110011111000111100110110101111101110001011110 eef08eacf2a9b0e98eb5a4eaa3f3f2b28eb9f1e6d7deeef08eacf2a9b0e98eb5a4eaa3f3f2b28eb9f1e6d7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)