To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????E?????????EB 001111110011111100111111001111110011111100111111001111110011111100111111010001010011111100111111001111110011111100111111001111110011111100111111001111110100010101000010 3f3f3f3f3f3f3f3f3f453f3f3f3f3f3f3f3f3f4542
SJIS-WIN セフ漆シホ丞ミシアEセフ漆シホ丞ミシアEB 10111110110011001000111010111101101111001100111010001111111001011101000010111100101100010100010110111110110011001000111010111101101111001100111010001111111001011101000010111100101100010100010101000010 becc8ebdbcce8fe5d0bcb145becc8ebdbcce8fe5d0bcb14542
EUC-JP セフ漆シホ丞ミシアEセフ漆シホ丞ミシアEB 100011101011111010001110110011001011110010111111100011101011110010001110110011101011111011100111100011101101000010001110101111001000111010110001010001011000111010111110100011101100110010111100101111111000111010111100100011101100111010111110111001111000111011010000100011101011110010001110101100010100010101000010 8ebe8eccbcbf8ebc8ecebee78ed08ebc8eb1458ebe8eccbcbf8ebc8ecebee78ed08ebc8eb14542
UTF-8 セフ漆シホ丞ミシアEセフ漆シホ丞ミシアEB 111011111011110110111110111011111011111010001100111001101011110010000110111011111011110110111100111011111011111010001110111001001011100010011110111011111011111010010000111011111011110110111100111011111011110110110001010001011110111110111101101111101110111110111110100011001110011010111100100001101110111110111101101111001110111110111110100011101110010010111000100111101110111110111110100100001110111110111101101111001110111110111101101100010100010101000010 efbdbeefbe8ce6bc86efbdbcefbe8ee4b89eefbe90efbdbcefbdb145efbdbeefbe8ce6bc86efbdbcefbe8ee4b89eefbe90efbdbcefbdb14542
UHC ??漆??丞???E??漆??丞???EB 00111111001111111111011011010100001111110011111111100011101010100011111100111111001111110100010100111111001111111111011011010100001111110011111111100011101010100011111100111111001111110100010101000010 3f3ff6d43f3fe3aa3f3f3f453f3ff6d43f3fe3aa3f3f3f4542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)