To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???H?????n}???H?????n{^ 0011111100111111001111110100100000111111001111110011111100111111001111110110111001111101001111110011111100111111010010000011111100111111001111110011111100111111011011100111101101011110 3f3f3f483f3f3f3f3f6e7d3f3f3f483f3f3f3f3f6e7b5e
SJIS-WIN ???H?蝴?咳?n}???H?蝴?咳?n{^ 001111110011111100111111010010000011111111100101100110100011111110001010010100000011111101101110011111010011111100111111001111110100100000111111111001011001101000111111100010100101000000111111011011100111101101011110 3f3f3f483fe59a3f8a503f6e7d3f3f3f483fe59a3f8a503f6e7b5e
EUC-JP ???H?蝴?咳?n}???H?蝴?咳?n{^ 001111110011111100111111010010000011111111101001111110100011111110110011101100010011111101101110011111010011111100111111001111110100100000111111111010011111101000111111101100111011000100111111011011100111101101011110 3f3f3f483fe9fa3fb3b13f6e7d3f3f3f483fe9fa3fb3b13f6e7b5e
UTF-8 뤱횓ㆋH렱蝴렑咳렱n}뤱횓ㆋH렱蝴렑咳렱n{^ 11101011101001001011000111101101100110101001001111100011100001101000101101001000111010111010000010110001111010001001110110110100111010111010000010010001111001011001001010110011111010111010000010110001011011100111110111101011101001001011000111101101100110101001001111100011100001101000101101001000111010111010000010110001111010001001110110110100111010111010000010010001111001011001001010110011111010111010000010110001011011100111101101011110 eba4b1ed9a93e3868b48eba0b1e89db4eba091e592b3eba0b16e7deba4b1ed9a93e3868b48eba0b1e89db4eba091e592b3eba0b16e7b5e
UHC 뤱횓ㆋH렱蝴렑咳렱n}뤱횓ㆋH렱蝴렑咳렱n{^ 100011111101111111000011100011101010010011111011010010001000111010111110111110111101110110001110101001101111101010100110100011101011111001101110011111011000111111011111110000111000111010100100111110110100100010001110101111101111101111011101100011101010011011111010101001101000111010111110011011100111101101011110 8fdfc38ea4fb488ebefbdd8ea6faa68ebe6e7d8fdfc38ea4fb488ebefbdd8ea6faa68ebe6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)