To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援?ぜ矣??猥??夷??喩??沃?? 00111111001111110011111110001001100001110011111110000010101110101110000111100001001111110011111111100000110011100011111100111111100010001100111000111111001111111001101001100111001111110011111110010111100000000011111100111111 3f3f3f89873f82bae1e13f3fe0ce3f3f88ce3f3f9a673f3f97803f3f
EUC-JP ???援?ぜ矣??猥??夷??喩??沃?? 00111111001111110011111110110001111001110011111110100100101111001110001011100011001111110011111111100000110100000011111100111111101100001101000000111111001111111101001111001000001111110011111111001101111000000011111100111111 3f3f3fb1e73fa4bce2e33f3fe0d03f3fb0d03f3fd3c83f3fcde03f3f
UTF-8 歷띰퐢援앲ぜ矣뺢퍢猥됰뗀夷꿨춢喩뽯굜沃쇰푶 111011111010011010001100111010111001110110110000111011011001000010100010111001101000111110110100111011001001010110110010111000111000000110011100111001111001111110100011111010111011101010100010111011011000110110100010111001111000110010100101111010111001000010110000111010111001011110000000111001011010010010110111111010101011111110101000111011001011011010100010111001011001011010101001111010111011110110101111111010101011010110011100111001101011001010000011111011001000011110110000111011011001000110110110 efa68ceb9db0ed90a2e68fb4ec95b2e3819ce79fa3ebbaa2ed8da2e78ca5eb90b0eb9780e5a4b7eabfa8ecb6a2e596a9ebbdafeab59ce6b283ec87b0ed91b6
UHC 歷띰퐢援앲ぜ矣뺢퍢猥됰뗀夷꿨춢喩뽯굜沃쇰푶 111001101011100010110110111011111011110110001011111010101011010110011101111010001010101010111100111010111111100010010101111010101011101110011001111010001110010110001001111010111011011010111110111011001010100010110010111001011010110110000011111010101110011110010110111010111000001010000100111010001010101010111100111010111011111010000100 e6b8b6efbd8beab59de8aabcebf895eabb99e8e589ebb6beeca8b2e5ad83eae796eb8284e8aabcebbe84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)