To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖??柚??乙l?嵬??循??儒??娃 100111011000101000111111001111111001011101001101001111110011111110001001101100111000001010001100001111111001101111001010001111110011111110001111011110100011111100111111100011101111001000111111001111111000100010100001 9d8a3f3f974d3f3f89b3828c3f9bca3f3f8f7a3f3f8ef23f3f88a1
EUC-JP 搖??柚??乙l?嵬??循??儒??娃 110110011110101000111111001111111100110110101110001111110011111110110010101101011010001111101100001111111101011011001100001111110011111110111101110110110011111100111111101111001111010000111111001111111011000010100011 d9ea3f3fcdae3f3fb2b5a3ec3fd6cc3f3fbddb3f3fbcf43f3fb0a3
UTF-8 搖깅쵎柚삥쨫乙l젂嵬됯램循욜뫀儒몃뼨娃 111001101001000010010110111010101011100110000101111011001011010110001110111001101001111110011010111011001000001010100101111011001010100010101011111001001011100110011001111011111011110110001100111011001010000010000010111001011011010110101100111010111001000010101111111010111001111010101000111001011011111010101010111011001001101010011100111010111010101110000000111001011000010010010010111010111010101010000011111010111011110010101000111001011010100010000011 e69096eab985ecb58ee69f9aec82a5eca8abe4b999efbd8ceca082e5b5aceb90afeb9ea8e5beaaec9a9cebab80e58492ebaa83ebbca8e5a883
UHC 搖깅쵎柚삥쨫乙l젂嵬됯램循욜뫀儒몃뼨娃 1110100011110100101100011110101110101100100100001110101011110110101110111110011010100100100001011110101111100000101000111110110010100000100001101110100011100011100010011110101010110111101001011110001011100000101111111110011110010001101001001110101011100011101110001110101110010110101010111110100011011111 e8f4b1ebac90eaf6bbe6a485ebe0a3eca086e8e389eab7a5e2e0bfe791a4eae3b8eb96abe8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)