To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 猥??膺??及??}猥??膺??及??{^ 111000001100111000111111001111111110010001011110001111110011111110001011011110010011111100111111011111011110000011001110001111110011111111100100010111100011111100111111100010110111100100111111001111110111101101011110 e0ce3f3fe45e3f3f8b793f3f7de0ce3f3fe45e3f3f8b793f3f7b5e
EUC-JP 猥??膺??及靷?}猥??膺??及靷?{^ 11100000110100000011111100111111111001111011111100111111001111111011010111011010100011111110011110111101001111110111110111100000110100000011111100111111111001111011111100111111001111111011010111011010100011111110011110111101001111110111101101011110 e0d03f3fe7bf3f3fb5da8fe7bd3f7de0d03f3fe7bf3f3fb5da8fe7bd3f7b5e
UTF-8 猥롮꼳膺곲뇦及靷땙}猥롮꼳膺곲뇦及靷땙{^ 111001111000110010100101111010111010000110101110111010101011110010110011111010001000011010111010111010101011001110110010111010111000011110100110111001011000111110001010111010011001110110110111111010111001010110011001011111011110011110001100101001011110101110100001101011101110101010111100101100111110100010000110101110101110101010110011101100101110101110000111101001101110010110001111100010101110100110011101101101111110101110010101100110010111101101011110 e78ca5eba1aeeabcb3e886baeab3b2eb87a6e58f8ae99db7eb95997de78ca5eba1aeeabcb3e886baeab3b2eb87a6e58f8ae99db7eb95997b5e
UHC 猥롮꼳膺곲뇦及靷땙}猥롮꼳膺곲뇦及靷땙{^ 111010001110010110001110111011001000010010001100111010111110110010000001111010011000011110001110110100001110000011101100111001101000101101101110011111011110100011100101100011101110110010000100100011001110101111101100100000011110100110000111100011101101000011100000111011001110011010001011011011100111101101011110 e8e58eec848cebec81e9878ed0e0ece68b6e7de8e58eec848cebec81e9878ed0e0ece68b6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)