To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 昱巳???????n}昱巳???????n{^ 111110100110001110010110101001000011111100111111001111110011111100111111001111110011111101101110011111011111101001100011100101101010010000111111001111110011111100111111001111110011111100111111011011100111101101011110 fa6396a43f3f3f3f3f3f3f6e7dfa6396a43f3f3f3f3f3f3f6e7b5e
EUC-JP 昱巳???????n}昱巳???????n{^ 1000111111000010101011011100110010100110001111110011111100111111001111110011111100111111001111110110111001111101100011111100001010101101110011001010011000111111001111110011111100111111001111110011111100111111011011100111101101011110 8fc2adcca63f3f3f3f3f3f3f6e7d8fc2adcca63f3f3f3f3f3f3f6e7b5e
UTF-8 昱巳댁렰쇘웩롋댄㉢n}昱巳댁렰쇘웩롋댄㉢n{^ 1110011010011000101100011110010110110111101100111110101110001100100000011110101110100000101100001110110010000111100110001110110010011011101010011110101110100001100010111110101110001100100001001110001110001001101000100110111001111101111001101001100010110001111001011011011110110011111010111000110010000001111010111010000010110000111011001000011110011000111011001001101110101001111010111010000110001011111010111000110010000100111000111000100110100010011011100111101101011110 e698b1e5b7b3eb8c81eba0b0ec8798ec9ba9eba18beb8c84e389a26e7de698b1e5b7b3eb8c81eba0b0ec8798ec9ba9eba18beb8c84e389a26e7b5e
UHC 昱巳댁렰쇘웩롋댄㉢n}昱巳댁렰쇘웩롋댄㉢n{^ 1110100111110000110111101101001110110100111011001000111010111101101111001110011111000000101000011000111011010001101101001110110110101000101100110110111001111101111010011111000011011110110100111011010011101100100011101011110110111100111001111100000010100001100011101101000110110100111011011010100010110011011011100111101101011110 e9f0ded3b4ec8ebdbce7c0a18ed1b4eda8b36e7de9f0ded3b4ec8ebdbce7c0a18ed1b4eda8b36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)