To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????Ø?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101100000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3fd83f3f3f3f3f3f3f3f3f3f
SJIS-WIN 憶?????怨??狎?????怨?????誼 100010011010111100111111001111110011111100111111001111111000100110000101001111110011111111100000101111100011111100111111001111110011111100111111100010011000010100111111001111110011111100111111001111111000101101100010 89af3f3f3f3f3f89853f3fe0be3f3f3f3f3f89853f3f3f3f3f8b62
EUC-JP 憶?????怨??狎?Ø???怨?????誼 1011001010110001001111110011111100111111001111110011111110110001111001010011111100111111111000001100000000111111100011111010100110101100001111110011111100111111101100011110010100111111001111110011111100111111001111111011010111000011 b2b13f3f3f3f3fb1e53f3fe0c03f8fa9ac3f3f3fb1e53f3f3f3f3fb5c3
UTF-8 憶귣봺利억쭓怨뺤젡狎띕Ø利억쭓怨뺤젛濾곌쑬誼 1110011010000110101101101110101010110111101000111110101110110100101110101110111110100111100111011110110010010110101101011110110010101101100100111110011010000000101010001110101110111010101001001110110010100000101000011110011110001011100011101110101110011101100101011100001110011000111011111010011110011101111011001001011010110101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010011011111011111010011010000100111010101011001110001100111011001001000110101100111010001010101010111100 e686b6eab7a3ebb4baefa79dec96b5ecad93e680a8ebbaa4eca0a1e78b8eeb9d95c398efa79dec96b5ecad93e680a8ebbaa4eca09befa684eab38cec91ace8aabc
UHC 憶귣봺利억쭓怨뺤젡狎띕Ø利억쭓怨뺤젛濾곌쑬誼 1110010111100011100000101110101110010100100000011110110010100110101111101110111110100111100010111110101010110011100101011110110010100000100110101110010011100100101101101110101110101000101010101110110010100110101111101110111110100111100010111110101010110011100101011110110010100000100101111110011010100100101100001110101010111110101010001110101111111110 e5e382eb9481eca6beefa78beab395eca09ae4e4b6eba8aaeca6beefa78beab395eca097e6a4b0eabea8ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)