To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???????維??碎??抑??逾??淫?? 001111110011111100111111001111110011111100111111001111111000100011011011001111110011111111100001111010100011111100111111100101110111110100111111001111111110011110100101001111110011111110001000111110100011111100111111 3f3f3f3f3f3f3f88db3f3fe1ea3f3f977d3f3fe7a53f3f88fa3f3f
EUC-JP ????獒??維??碎??抑??逾??淫?? 0011111100111111001111110011111110001111110010111011101100111111001111111011000011011101001111110011111111100010111011000011111100111111110011011101111000111111001111111110111010100111001111110011111110110000111111000011111100111111 3f3f3f3f8fcbbb3f3fb0dd3f3fe2ec3f3fcdde3f3feea73f3fb0fc3f3f
UTF-8 樂끸뫀흭獒뺣뱽維띹퉪碎븍쳺抑띔랜逾껅끽淫딅꽧 111011111010011010111111111010111000000110111000111010111010101110000000111011011001110110101101111001111000110110010010111010111011101010100011111010111011000110111101111001111011011010101101111010111001110110111001111011011000100110101010111001111010001010001110111010111011100010001101111011001011001110111010111001101000101010010001111010111001110110010100111010111001111010011100111010011000000010111110111010101011101110000101111010111000000110111101111001101011011110101011111010111001010010000101111010101011110110100111 efa6bfeb81b8ebab80ed9dade78d92ebbaa3ebb1bde7b6adeb9db9ed89aae7a28eebb88decb3bae68a91eb9d94eb9e9ce980beeabb85eb81bde6b7abeb9485eabda7
UHC 樂끸뫀흭獒뺣뱽維띹퉪碎븍쳺抑띔랜逾껅끽淫딅꽧 1110100011111001100001011110001010010001101001001100010110001001111010001010001110010101111010111001001110100011111010111010101110001101111010001011100110000010111000011110111110111010111010111010101110011101111001011110010010110110111010101011011110100011111010111011010110000011111001101011001110100011111010111110001010001010111010111000010010110010 e8f985e291a4c589e8a395eb93a3ebab8de8b982e1efbaebab9de5e4b6eab7a3ebb583e6b3a3ebe28aeb84b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)