To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 渦?????????ヨ??g?厓?????^ 1000100101010001001111110011111100111111001111110011111100111111001111110011111100111111100000111000100000111111001111111000001010000111001111111111101010001101001111110011111100111111001111110011111101011110 89513f3f3f3f3f3f3f3f3f83883f3f82873ffa8d3f3f3f3f3f5e
EUC-JP 渦?????????ヨ??g?厓?????^ 101100011011001000111111001111110011111100111111001111110011111100111111001111110011111110100101111010000011111100111111101000111110011100111111100011111011010011000111001111110011111100111111001111110011111101011110 b1b23f3f3f3f3f3f3f3f3fa5e83f3fa3e73f8fb4c73f3f3f3f3f5e
UTF-8 渦쏅젫蓮쇔룢淋껆눥列ヨ풘溜g눥厓쏇쉩蓮쇗씟^ 11100110101110001010011011101100100011111000010111101100101000001010101111101111101001101001100111101100100001111001010011101011101000111010001011101111101001111011010111101010101110111000011011101011100010001010010111101111101001101001110011100011100000111010100011101101100100101001100011101111101001111000101111101111101111011000011111101011100010001010010111100101100011101001001111101100100011111000011111101100100010011010100111101111101001101001100111101100100001111001011111101100100101001001111101011110 e6b8a6ec8f85eca0abefa699ec8794eba3a2efa7b5eabb86eb88a5efa69ce383a8ed9298efa78befbd87eb88a5e58e93ec8f87ec89a9efa699ec8797ec949f5e
UHC 渦쏅젫蓮쇔룢淋껆눥列ヨ풘溜g눥厓쏇쉩蓮쇗씟^ 11101000101111101001101111101011101000001010001111100110111001011011110011100101100011111001101111101100111110001000001111100111100001111011110011100110111010101010101111101000101111101001101111101010111111101010001111100111100001111011110011100100111011011001101111101101100110101000001111100110111001011011110011100110100111011011001101011110 e8be9beba0a3e6e5bce58f9becf883e787bce6eaabe8be9beafea3e787bce4ed9bed9a83e6e5bce69db35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)