To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮??蹂??阿??苑?????揶??隱 00111111001111110011111111101000010010100011111100111111111001101111100000111111001111111000100010100010001111110011111110001001100100010011111100111111001111110011111100111111100111011000100000111111001111111110100010101010 3f3f3fe84a3f3fe6f83f3f88a23f3f89913f3f3f3f3f9d883f3fe8aa
EUC-JP ???鍮??蹂??阿??苑?????揶??隱 00111111001111110011111111101111101010110011111100111111111011001111101000111111001111111011000010100100001111110011111110110001111100010011111100111111001111110011111100111111110110011110100000111111001111111111000010101100 3f3f3fefab3f3fecfa3f3fb0a43f3fb1f13f3f3f3f3fd9e83f3ff0ac
UTF-8 捻뀀맩鍮섓쭏蹂⑹춳阿숋퐣苑묈쪊硫몃옓揶쏅떻隱 111011111010011010100100111010111000000010000000111010111010011110101001111010011000110110101110111011001000010010010011111011001010110110001111111010001011100110000010111000101001000110111001111011001011011010110011111010011001100010111111111011001000100010001011111011011001000010100011111010001000101110010001111010111010110010001000111011001010101010001010111011111010011110001110111010111010101010000011111011001001100010010011111001101000111110110110111011001000111110000101111010111001011010111011111010011001101010110001 efa6a4eb8080eba7a9e98daeec8493ecad8fe8b982e291b9ecb6b3e998bfec888bed90a3e88b91ebac88ecaa8aefa78eebaa83ec9893e68fb6ec8f85eb96bbe99ab1
UHC 捻뀀맩鍮섓쭏蹂⑹춳阿숋퐣苑묈쪊硫몃옓揶쏅떻隱 1110011011110111101100101110101110010000101100011110101110111001100110001110111110100111100010001110101110110011101010011110110010101101100011111110010010111001100110011110111110111101100011001110101010111101100100011110010110100101100001001110101110101001101110001110101110011110100110011110010110101010100110111110101110110110101110111110101111011111 e6f7b2eb90b1ebb998efa788ebb3a9ecad8fe4b999efbd8ceabd91e5a584eba9b8eb9e99e5aa9bebb6bbebdf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)