To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 詣≪???????[詣≪???????[^ 10001100011101111000000111100001001111110011111100111111001111110011111100111111001111110101101110001100011101111000000111100001001111110011111100111111001111110011111100111111001111110101101101011110 8c7781e13f3f3f3f3f3f3f5b8c7781e13f3f3f3f3f3f3f5b5e
EUC-JP 詣≪???????[詣≪???????[^ 10110111110110001010001011100011001111110011111100111111001111110011111100111111001111110101101110110111110110001010001011100011001111110011111100111111001111110011111100111111001111110101101101011110 b7d8a2e33f3f3f3f3f3f3f5bb7d8a2e33f3f3f3f3f3f3f5b5e
UTF-8 詣≪풄溜겼읃溜롫젍[詣≪풄溜겼읃溜롫젍[^ 111010001010100110100011111000101000100110101010111011011001001010000100111011111010011110001011111010101011001010111100111011001001110110000011111011111010011110001011111010111010000110101011111011001010000010001101010110111110100010101001101000111110001010001001101010101110110110010010100001001110111110100111100010111110101010110010101111001110110010011101100000111110111110100111100010111110101110100001101010111110110010100000100011010101101101011110 e8a9a3e289aaed9284efa78beab2bcec9d83efa78beba1abeca08d5be8a9a3e289aaed9284efa78beab2bcec9d83efa78beba1abeca08d5b5e
UHC 詣≪풄溜겼읃溜롫젍[詣≪풄溜겼읃溜롫젍[^ 111001111110000110100001111011001011111010001100111010101111111010110000111001011001111110111010111010101111111010001110111010111010000010001110010110111110011111100001101000011110110010111110100011001110101011111110101100001110010110011111101110101110101011111110100011101110101110100000100011100101101101011110 e7e1a1ecbe8ceafeb0e59fbaeafe8eeba08e5be7e1a1ecbe8ceafeb0e59fbaeafe8eeba08e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)