To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????TSB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010101000101001101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f545342
SJIS-WIN 夜??厓ц???ァ??????厓э?TSB 100101101110100100111111001111111111101010001101100001001000100000111111001111110011111110000011010000000011111100111111001111110011111100111111001111111111101010001101100001001000111100111111010101000101001101000010 96e93f3ffa8d84883f3f3f83403f3f3f3f3f3ffa8d848f3f545342
EUC-JP 夜??厓ц???ァ??????厓э?TSB 1100110011101011001111110011111110001111101101001100011110100111111010000011111100111111001111111010010110100001001111110011111100111111001111110011111100111111100011111011010011000111101001111110111100111111010101000101001101000010 cceb3f3f8fb4c7a7e83f3f3fa5a13f3f3f3f3f3f8fb4c7a7ef3f545342
UTF-8 夜쇽푵厓ц춶溫볡ァ溫뽳쉽若듸숲厓э푶TSB 11100101101001001001110011101100100001111011110111101101100100011011010111100101100011101001001111010001100001101110110010110110101101101110011010111010101010111110101110110011101000011110001110000010101000011110011010111010101010111110101110111101101100111110110010001001101111011110111110100101101101001110101110010011101110001110110010001000101100101110010110001110100100111101000110001101111011011001000110110110010101000101001101000010 e5a49cec87bded91b5e58e93d186ecb6b6e6baabebb3a1e382a1e6baabebbdb3ec89bdefa5b4eb93b8ec88b2e58e93d18ded91b6545342
UHC 夜쇽푵厓ц춶溫볡ァ溫뽳쉽若듸숲厓э푶TSB 111001011010100010111100111011111011111010000011111001001110110110101100111010001010110110010010111010001010111010010011111001111010101110100001111010001010111010010110111011111011110110110001111001011010111010110101111011111011110110100011111001001110110110101100111011111011111010000100010101000101001101000010 e5a8bcefbe83e4edace8ad92e8ae93e7aba1e8ae96efbdb1e5aeb5efbda3e4edacefbe84545342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)