To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鳶??違??遺??耶??鳶??違??遺??耶??^ 100100111100111000111111001111111000100011100001001111110011111110001000111000100011111100111111100101101110101100111111001111111001001111001110001111110011111110001000111000010011111100111111100010001110001000111111001111111001011011101011001111110011111101011110 93ce3f3f88e13f3f88e23f3f96eb3f3f93ce3f3f88e13f3f88e23f3f96eb3f3f5e
EUC-JP 鳶??違??遺??耶??鳶??違??遺??耶??^ 110001101101000000111111001111111011000011100011001111110011111110110000111001000011111100111111110011001110110100111111001111111100011011010000001111110011111110110000111000110011111100111111101100001110010000111111001111111100110011101101001111110011111101011110 c6d03f3fb0e33f3fb0e43f3fcced3f3fc6d03f3fb0e33f3fb0e43f3fcced3f3f5e
UTF-8 鳶롫끏違욄룚遺살쭟耶븐쵌鳶롫끏違욄룚遺살쭟耶븐쵌^ 11101001101100111011011011101011101000011010101111101011100000011000111111101001100000011001010111101100100110101000010011101011101000111001101011101001100000011011101011101100100000101011010011101100101011011001111111101000100000001011011011101011101110001001000011101100101101011000110011101001101100111011011011101011101000011010101111101011100000011000111111101001100000011001010111101100100110101000010011101011101000111001101011101001100000011011101011101100100000101011010011101100101011011001111111101000100000001011011011101011101110001001000011101100101101011000110001011110 e9b3b6eba1abeb818fe98195ec9a84eba39ae981baec82b4ecad9fe880b6ebb890ecb58ce9b3b6eba1abeb818fe98195ec9a84eba39ae981baec82b4ecad9fe880b6ebb890ecb58c5e
UHC 鳶롫끏違욄룚遺살쭟耶븐쵌鳶롫끏違욄룚遺살쭟耶븐쵌^ 11100110111010011000111011101011100001011011111111101010110111101001111011100110100011111001011011101011101101101011101111101100101001111001010011100101101011011011101011101100101011001000111011100110111010011000111011101011100001011011111111101010110111101001111011100110100011111001011011101011101101101011101111101100101001111001010011100101101011011011101011101100101011001000111001011110 e6e98eeb85bfeade9ee68f96ebb6bbeca794e5adbaecac8ee6e98eeb85bfeade9ee68f96ebb6bbeca794e5adbaecac8e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)