To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???雲??舜???????雲??舜????^ 001111110011111100111111100010010101111100111111001111111000111101110111001111110011111100111111001111110011111100111111001111111000100101011111001111110011111110001111011101110011111100111111001111110011111101011110 3f3f3f895f3f3f8f773f3f3f3f3f3f3f895f3f3f8f773f3f3f3f5e
EUC-JP ???雲??舜???????雲??舜????^ 001111110011111100111111101100011100000000111111001111111011110111011000001111110011111100111111001111110011111100111111001111111011000111000000001111110011111110111101110110000011111100111111001111110011111101011110 3f3f3fb1c03f3fbdd83f3f3f3f3f3f3fb1c03f3fbdd83f3f3f3f5e
UTF-8 쒔렜쒔雲렚쒔舜렋렣렋롆쒔렜쒔雲렚쒔舜렋렣렋롆^ 11101100100100101001010011101011101000001001110011101100100100101001010011101001100110111011001011101011101000001001101011101100100100101001010011101000100010001001110011101011101000001000101111101011101000001010001111101011101000001000101111101011101000011000011011101100100100101001010011101011101000001001110011101100100100101001010011101001100110111011001011101011101000001001101011101100100100101001010011101000100010001001110011101011101000001000101111101011101000001010001111101011101000001000101111101011101000011000011001011110 ec9294eba09cec9294e99bb2eba09aec9294e8889ceba08beba0a3eba08beba186ec9294eba09cec9294e99bb2eba09aec9294e8889ceba08beba0a3eba08beba1865e
UHC 쒔렜쒔雲렚쒔舜렋렣렋롆쒔렜쒔雲렚쒔舜렋렣렋롆^ 101111101010110110001110101011101011111010101101111010101010001110001110101011011011111010101101111000101110111110001110101000101000111010110100100011101010001010001110110011001011111010101101100011101010111010111110101011011110101010100011100011101010110110111110101011011110001011101111100011101010001010001110101101001000111010100010100011101100110001011110 bead8eaebeadeaa38eadbeade2ef8ea28eb48ea28eccbead8eaebeadeaa38eadbeade2ef8ea28eb48ea28ecc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)