To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????±?????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110001001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f3f3f3f3f5e
SJIS-WIN ?西????????佃□?潔?±?禿?藜感^ 001111111001000010111100001111110011111100111111001111110011111100111111001111110011111110010010110011111000000110100000001111111000110010001001001111111000000101111101001111111001001111000011001111111110010101011011100010101011010001011110 3f90bc3f3f3f3f3f3f3f3f92cf81a03f8c893f817d3f93c33fe55b8ab45e
EUC-JP ?西????????佃□?潔?±?禿?藜感^ 001111111100000010111110001111110011111100111111001111110011111100111111001111110011111111000100110100011010001010100010001111111011011111101001001111111010000111011110001111111100011011000101001111111110100110111100101101001011011001011110 3fc0be3f3f3f3f3f3f3f3fc4d1a2a23fb7e93fa1de3fc6c53fe9bcb4b65e
UTF-8 렊西롆쒔렊쒀롎쒔롐뤛佃□쨴潔춲±쵌禿춲藜感^ 111010111010000010001010111010001010010110111111111010111010000110000110111011001001001010010100111010111010000010001010111011001001001010000000111010111010000110001110111011001001001010010100111010111010000110010000111010111010010010011011111001001011110110000011111000101001011010100001111011001010100010110100111001101011110110010100111011001011011010110010110000101011000111101100101101011000110011100111101001101011111111101100101101101011001011101000100101111001110011100110100001001001111101011110 eba08ae8a5bfeba186ec9294eba08aec9280eba18eec9294eba190eba49be4bd83e296a1eca8b4e6bd94ecb6b2c2b1ecb58ce7a6bfecb6b2e8979ce6849f5e
UHC 렊西롆쒔렊쒀롎쒔롐뤛佃□쨴潔춲±쵌禿춲藜感^ 10001110101000011110000010100100100011101100110010111110101011011000111010100001101111101010110010001110110101001011111010101101100011101101011010001111110010101110111011101100101000011110000010100100100011101100110010111110101011011000111010100001101111101010110010001110110101001011111010101101100011101101010111101101110010101110111101011110 8ea1e0a48eccbead8ea1beac8ed4bead8ed68fcaeeeca1e0a48eccbead8ea1beac8ed4bead8ed5edcaef5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)