To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 止?邑基??肄???????邑基??肄???疑?^ 100011100111111000111111100101110101011110001010111011100011111100111111111000111110010100111111001111110011111100111111001111110011111100111111100101110101011110001010111011100011111100111111111000111110010100111111001111110011111110001011010111100011111101011110 8e7e3f97578aee3f3fe3e53f3f3f3f3f3f3f97578aee3f3fe3e53f3f3f8b5e3f5e
EUC-JP 止?邑基??肄???????邑基??肄???疑?^ 101110111101111100111111110011011011100010110100111100000011111100111111111001101110011100111111001111110011111100111111001111110011111100111111110011011011100010110100111100000011111100111111111001101110011100111111001111110011111110110101101111110011111101011110 bbdf3fcdb8b4f03f3fe6e73f3f3f3f3f3f3fcdb8b4f03f3fe6e73f3f3fb5bf3f5e
UTF-8 止렣邑基렰렱肄ㆁ렰렱李뀀렖렣邑基렰렱肄ㆁ렰렱疑렗^ 11100110101011011010001011101011101000001010001111101001100000101001000111100101100111111011101011101011101000001011000011101011101000001011000111101000100000101000010011100011100001101000000111101011101000001011000011101011101000001011000111101111101001111010000111101011100000001000000011101011101000001001011011101011101000001010001111101001100000101001000111100101100111111011101011101011101000001011000011101011101000001011000111101000100000101000010011100011100001101000000111101011101000001011000011101011101000001011000111100111100101101001000111101011101000001001011101011110 e6ada2eba0a3e98291e59fbaeba0b0eba0b1e88284e38681eba0b0eba0b1efa7a1eb8080eba096eba0a3e98291e59fbaeba0b0eba0b1e88284e38681eba0b0eba0b1e79691eba0975e
UHC 止렣邑基렰렱肄ㆁ렰렱李뀀렖렣邑基렰렱肄ㆁ렰렱疑렗^ 11110010101011011000111010110100111010111110100111010000111100011000111010111101100011101011111011101100101111011010010011110001100011101011110110001110101111101110110010110000101100101110101110001110101010111000111010110100111010111110100111010000111100011000111010111101100011101011111011101100101111011010010011110001100011101011110110001110101111101110101111110111100011101010110001011110 f2ad8eb4ebe9d0f18ebd8ebeecbda4f18ebd8ebeecb0b2eb8eab8eb4ebe9d0f18ebd8ebeecbda4f18ebd8ebeebf78eac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)