To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝?????喩??獄??苑?????沃??姨 11101010010000000011111100111111001111110011111100111111100110100110011100111111001111111000110110010110001111110011111110001001100100010011111100111111001111110011111100111111100101111000000000111111001111111001101101001000 ea403f3f3f3f3f9a673f3f8d963f3f89913f3f3f3f3f97803f3f9b48
EUC-JP 鵝??瑗??喩??獄??苑?????沃??姨 111100111010000100111111001111111000111111001100110000000011111100111111110100111100100000111111001111111011100111110110001111110011111110110001111100010011111100111111001111110011111100111111110011011110000000111111001111111101010110101001 f3a13f3f8fccc03f3fd3c83f3fb9f63f3fb1f13f3f3f3f3fcde03f3fd5a9
UTF-8 鵝숈뮇瑗띌윀喩믩럞獄쎼룂苑묈쪊硫몃궖沃랃퐛姨 111010011011010110011101111011001000100010001000111010111010111010000111111001111001000110010111111010111001110110001100111011001001110010000000111001011001011010101001111010111010111110101001111010111001111110011110111001111000110110000100111011001000111010111100111010111010001110000010111010001000101110010001111010111010110010001000111011001010101010001010111011111010011110001110111010111010101010000011111010101011011010010110111001101011001010000011111010111001111010000011111011011001000010011011111001011010011110101000 e9b59dec8888ebae87e79197eb9d8cec9c80e596a9ebafa9eb9f9ee78d84ec8ebceba382e88b91ebac88ecaa8aefa78eebaa83eab696e6b283eb9e83ed909be5a7a8
UHC 鵝숈뮇瑗띌윀喩믩럞獄쎼룂苑묈쪊硫몃궖沃랃퐛姨 1110010010111101100110011110110010010010100101101110101010111100101101101110100110011111100010111110101011100111100100101110101110001110100000011110100010101011100110111110001110001111100000111110101010111101100100011110010110100101100001001110101110101001101110001110101110000010101010111110100010101010100011011110111110111101100001011110110010101001 e4bd99ec9296eabcb6e99f8beae792eb8e81e8ab9be38f83eabd91e5a584eba9b8eb82abe8aa8defbd85eca9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)