To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 齧??癬齧???N}齧??癬齧???N{^ 111010101001011000111111001111111110000110011101111010101001011000111111001111110011111101001110011111011110101010010110001111110011111111100001100111011110101010010110001111110011111100111111010011100111101101011110 ea963f3fe19dea963f3f3f4e7dea963f3fe19dea963f3f3f4e7b5e
EUC-JP 齧??癬齧??璇N}齧??癬齧??璇N{^ 11110011111101100011111100111111111000011111110111110011111101100011111100111111100011111100110011001111010011100111110111110011111101100011111100111111111000011111110111110011111101100011111100111111100011111100110011001111010011100111101101011110 f3f63f3fe1fdf3f63f3f8fcccf4e7df3f63f3fe1fdf3f63f3f8fcccf4e7b5e
UTF-8 齧섈쓱癬齧섈쓱璇N}齧섈쓱癬齧섈쓱璇N{^ 1110100110111101101001111110110010000100100010001110110010010011101100011110011110011001101011001110100110111101101001111110110010000100100010001110110010010011101100011110011110010010100001110100111001111101111010011011110110100111111011001000010010001000111011001001001110110001111001111001100110101100111010011011110110100111111011001000010010001000111011001001001110110001111001111001001010000111010011100111101101011110 e9bda7ec8488ec93b1e799ace9bda7ec8488ec93b1e792874e7de9bda7ec8488ec93b1e799ace9bda7ec8488ec93b1e792874e7b5e
UHC 齧섈쓱癬齧섈쓱璇N}齧섈쓱癬齧섈쓱璇N{^ 11100000111001011011110010101010101111101011001111100000110010001110000011100101101111001010101010111110101100111110000011000110010011100111110111100000111001011011110010101010101111101011001111100000110010001110000011100101101111001010101010111110101100111110000011000110010011100111101101011110 e0e5bcaabeb3e0c8e0e5bcaabeb3e0c64e7de0e5bcaabeb3e0c8e0e5bcaabeb3e0c64e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)