To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????QB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5142
SJIS-WIN ????????????????????QB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5142
EUC-JP ????????????????????QB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5142
UTF-8 셔샹셔섰셍섯셔섹셍섄셍롚렽섟셔섀셍섹셍샬QB 1110110010000101100101001110110010000011101110011110110010000101100101001110110010000100101100001110110010000101100011011110110010000100101011111110110010000101100101001110110010000100101110011110110010000101100011011110110010000100100001001110110010000101100011011110101110100001100110101110101110100000101111011110110010000100100111111110110010000101100101001110110010000100100000001110110010000101100011011110110010000100101110011110110010000101100011011110110010000011101011000101000101000010 ec8594ec83b9ec8594ec84b0ec858dec84afec8594ec84b9ec858dec8484ec858deba19aeba0bdec849fec8594ec8480ec858dec84b9ec858dec83ac5142
UHC 셔샹셔섰셍섯셔섹셍섄셍롚렽섟셔섀셍섹셍샬QB 101111001100010110111100101001111011110011000101101111001011100110111100110001001011110010111000101111001100010110111100101111011011110011000100101111001010100110111100110001001000111011011110100011101100010110111100101100001011110011000101101111001010100010111100110001001011110010111101101111001100010010111100101000110101000101000010 bcc5bca7bcc5bcb9bcc4bcb8bcc5bcbdbcc4bca9bcc48ede8ec5bcb0bcc5bca8bcc4bcbdbcc4bca35142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)