To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??萸??矣?????油ゅ?循??額?? 10011000110110100011111100111111111001001100111000111111001111111110000111100001001111110011111100111111001111110011111110010110111110111000001011100011001111111000111101111010001111110011111110001010011110100011111100111111 98da3f3fe4ce3f3fe1e13f3f3f3f3f96fb82e33f8f7a3f3f8a7a3f3f
EUC-JP 俑??萸??矣?????油ゅ?循??額?? 11010000110111000011111100111111111010001101000000111111001111111110001011100011001111110011111100111111001111110011111111001100111111011010010011100101001111111011110111011011001111110011111110110011110110110011111100111111 d0dc3f3fe8d03f3fe2e33f3f3f3f3fccfda4e53fbddb3f3fb3db3f3f
UTF-8 俑앹늿萸썸뤃矣묒춳醴븐슦油ゅ춢循뗫걦額됰컙 111001001011111110010001111011001001010110111001111010111000101010111111111010001001000010111000111011001000110110111000111010111010010010000011111001111001111110100011111010111010110010010010111011001011011010110011111011111010011010110111111010111011100010010000111011001000101010100110111001101011001010111001111000111000001010000101111011001011011010100010111001011011111010101010111010111001011110101011111010101011000110100110111010011010000110001101111010111001000010110000111011001011101110011001 e4bf91ec95b9eb8abfe890b8ec8db8eba483e79fa3ebac92ecb6b3efa6b7ebb890ec8aa6e6b2b9e38285ecb6a2e5beaaeb97abeab1a6e9a18deb90b0ecbb99
UHC 俑앹늿萸썸뤃矣묒춳醴븐슦油ゅ춢循뗫걦額됰컙 111010011011010110011101111011001000100010001000111010111010110110111101111001101000111110110100111010111111100010010001111011001010110110001111111001111110010010111010111011001001101010110000111010101111101010101010111001011010110110000011111000101110000010001011111010111000000110001111111001001111111010001001111010111011000010000100 e9b59dec8888ebadbde68fb4ebf891ecad8fe7e4baec9ab0eafaaae5ad83e2e08beb818fe4fe89ebb084

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)