To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????KB 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110100101101000010 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f4b42
SJIS-WIN 浴??慂?????C浴??慂?????KB 10010111100000010011111100111111100111001100100000111111001111110011111100111111001111110100001110010111100000010011111100111111100111001100100000111111001111110011111100111111001111110100101101000010 97813f3f9cc83f3f3f3f3f4397813f3f9cc83f3f3f3f3f4b42
EUC-JP 浴??慂?????C浴??慂?????KB 11001101111000010011111100111111110110001100101000111111001111110011111100111111001111110100001111001101111000010011111100111111110110001100101000111111001111110011111100111111001111110100101101000010 cde13f3fd8ca3f3f3f3f3f43cde13f3fd8ca3f3f3f3f3f4b42
UTF-8 浴녺쐞慂뤹쑊念곲떵C浴녺쐞慂뤹쑊念곲떵KB 111001101011010110110100111010111000010110111010111011001001000010011110111001101000010110000010111010111010010010111001111011001001000110001010111011111010011010100011111010101011001110110010111010111001011010110101010000111110011010110101101101001110101110000101101110101110110010010000100111101110011010000101100000101110101110100100101110011110110010010001100010101110111110100110101000111110101010110011101100101110101110010110101101010100101101000010 e6b5b4eb85baec909ee68582eba4b9ec918aefa6a3eab3b2eb96b543e6b5b4eb85baec909ee68582eba4b9ec918aefa6a3eab3b2eb96b54b42
UHC 浴녺쐞慂뤹쑊念곲떵C浴녺쐞慂뤹쑊念곲떵KB 111010011011000110000110111001111001110010000100111010011011110110001111111001111001110010101001111001101111011010000001111010011011011010111010010000111110100110110001100001101110011110011100100001001110100110111101100011111110011110011100101010011110011011110110100000011110100110110110101110100100101101000010 e9b186e79c84e9bd8fe79ca9e6f681e9b6ba43e9b186e79c84e9bd8fe79ca9e6f681e9b6ba4b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)