To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竣?d遇?臍經??竣?d遇?臍經??^ 1000111101110110001111111000001010000100100010111111011000111111111001000110000011100011010100110011111100111111100011110111011000111111100000101000010010001011111101100011111111100100011000001110001101010011001111110011111101011110 8f763f82848bf63fe460e3533f3f8f763f82848bf63fe460e3533f3f5e
EUC-JP 竣?d遇?臍經??竣?d遇?臍經??^ 1011110111010111001111111010001111100100101101101111100000111111111001111100000111100101101101000011111100111111101111011101011100111111101000111110010010110110111110000011111111100111110000011110010110110100001111110011111101011110 bdd73fa3e4b6f83fe7c1e5b43f3fbdd73fa3e4b6f83fe7c1e5b43f3f5e
UTF-8 竣얍d遇렟臍經렞렢竣얍d遇렟臍經렞렗^ 11100111101010111010001111101100100101101000110111101111101111011000010011101001100000011000011111101011101000001001111111101000100001111000110111100111101101101001001111101011101000001001111011101011101000001010001011100111101010111010001111101100100101101000110111101111101111011000010011101001100000011000011111101011101000001001111111101000100001111000110111100111101101101001001111101011101000001001111011101011101000001001011101011110 e7aba3ec968defbd84e98187eba09fe8878de7b693eba09eeba0a2e7aba3ec968defbd84e98187eba09fe8878de7b693eba09eeba0975e
UHC 竣얍d遇렟臍經렞렢竣얍d遇렟臍經렞렗^ 11110001111000101011111011100101101000111110010011101001111001111000111010110000111100001011000011001100111010001000111010101111100011101011001111110001111000101011111011100101101000111110010011101001111001111000111010110000111100001011000011001100111010001000111010101111100011101010110001011110 f1e2bee5a3e4e9e78eb0f0b0cce88eaf8eb3f1e2bee5a3e4e9e78eb0f0b0cce88eaf8eac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)