To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 俉??玉?????語↑?節?С???要??^ 1111101001100001001111110011111110001011110010100011111100111111001111110011111100111111100011001110101010000001101010100011111110010000110111110011111110000100010100100011111100111111001111111001011101110110001111110011111101011110 fa613f3f8bca3f3f3f3f3f8cea81aa3f90df3f84523f3f3f97763f3f5e
EUC-JP 俉??玉??孼??語↑?節?С???要??^ 1000111110110001101110110011111100111111101101101100110000111111001111111000111110111010110000110011111100111111101110001110110010100010101011000011111111000000111000010011111110100111101100110011111100111111001111111100110111010111001111110011111101011110 8fb1bb3f3fb6cc3f3f8fbac33f3fb8eca2ac3fc0e13fa7b33f3f3fcdd73f3f5e
UTF-8 俉놂슝玉녔쫲孼닻찕語↑뜵節배С礖딀뤃要뺞궠^ 111001001011111110001001111010111000011010000010111011001000101010011101111001111000111010001001111010111000010110010100111011001010101110110010111001011010110110111100111010111000101110111011111011001011000010010101111010001010101010011110111000101000011010010001111010111001110010110101111001111010111110000000111010111011000010110000110100001010000111100111101001001001011011101011100101001000000011101011101001001000001111101000101001101000000111101011101110101001111011101010101101101010000001011110 e4bf89eb8682ec8a9de78e89eb8594ecabb2e5adbceb8bbbecb095e8aa9ee28691eb9cb5e7af80ebb0b0d0a1e7a496eb9480eba483e8a681ebba9eeab6a05e
UHC 俉놂슝玉녔쫲孼닻찕語↑뜵節배С礖딀뤃要뺞궠^ 11100111111010111011001111101111101111011011100111101000101011001011001111100110101001101000101011100101111011011011010011101001101010011001010111100101110111101010000111101000100011011011001111101111101111011011100111101000101011001011001111100110101001101000101011100110100011111011010011101001101010011001010111100110100000101011001101011110 e7ebb3efbdb9e8acb3e6a68ae5edb4e9a995e5dea1e88db3efbdb9e8acb3e6a68ae68fb4e9a995e682b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)