To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ????ぞ?紺叛?}????ぞ?紺叛?{^ 001111110011111100111111001111111000001010111100001111111000110110101110100101001011111000111111011111010011111100111111001111110011111110000010101111000011111110001101101011101001010010111110001111110111101101011110 3f3f3f3f82bc3f8dae94be3f7d3f3f3f3f82bc3f8dae94be3f7b5e
EUC-JP ????ぞ?紺叛?}????ぞ?紺叛?{^ 001111110011111100111111001111111010010010111110001111111011101010110000110010001100000000111111011111010011111100111111001111110011111110100100101111100011111110111010101100001100100011000000001111110111101101011110 3f3f3f3fa4be3fbab0c8c03f7d3f3f3f3fa4be3fbab0c8c03f7b5e
UTF-8 쒔롌뤏쮱ぞ쳩紺叛롎}쒔롌뤏쮱ぞ쳩紺叛롎{^ 111011001001001010010100111010111010000110001100111010111010010010001111111011001010111010110001111000111000000110011110111011001011001110101001111001111011010010111010111001011000111110011011111010111010000110001110011111011110110010010010100101001110101110100001100011001110101110100100100011111110110010101110101100011110001110000001100111101110110010110011101010011110011110110100101110101110010110001111100110111110101110100001100011100111101101011110 ec9294eba18ceba48fecaeb1e3819eecb3a9e7b4bae58f9beba18e7dec9294eba18ceba48fecaeb1e3819eecb3a9e7b4bae58f9beba18e7b5e
UHC 쒔롌뤏쮱ぞ쳩紺叛롎}쒔롌뤏쮱ぞ쳩紺叛롎{^ 101111101010110110001110110100101000111110111111101010001000111010101010101111101010101110001110110010101111101011011010111001001000111011010100011111011011111010101101100011101101001010001111101111111010100010001110101010101011111010101011100011101100101011111010110110101110010010001110110101000111101101011110 bead8ed28fbfa88eaabeab8ecafadae48ed47dbead8ed28fbfa88eaabeab8ecafadae48ed47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)