To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????r[?????????r[^ 0011111100111111001111110011111100111111001111110011111100111111001111110111001001011011001111110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f3f725b5e
SJIS-WIN 潁?????擁??r[潁?????擁??r[^ 100111111111000100111111001111110011111100111111001111111001011101101001001111110011111101110010010110111001111111110001001111110011111100111111001111110011111110010111011010010011111100111111011100100101101101011110 9ff13f3f3f3f3f97693f3f725b9ff13f3f3f3f3f97693f3f725b5e
EUC-JP 潁?????擁??r[潁?????擁??r[^ 110111101111001100111111001111110011111100111111001111111100110111001010001111110011111101110010010110111101111011110011001111110011111100111111001111110011111111001101110010100011111100111111011100100101101101011110 def33f3f3f3f3fcdca3f3f725bdef33f3f3f3f3fcdca3f3f725b5e
UTF-8 潁얜젪歷쎈젣擁노젍r[潁얜젪歷쎈젣擁노젍r[^ 1110011010111101100000011110110010010110100111001110110010100000101010101110111110100110100011001110110010001110100010001110110010100000101000111110011010010011100000011110101110000101101110001110110010100000100011010111001001011011111001101011110110000001111011001001011010011100111011001010000010101010111011111010011010001100111011001000111010001000111011001010000010100011111001101001001110000001111010111000010110111000111011001010000010001101011100100101101101011110 e6bd81ec969ceca0aaefa68cec8e88eca0a3e69381eb85b8eca08d725be6bd81ec969ceca0aaefa68cec8e88eca0a3e69381eb85b8eca08d725b5e
UHC 潁얜젪歷쎈젣擁노젍r[潁얜젪歷쎈젣擁노젍r[^ 1110011110111000101111101110101110100000101000101110011010111000101111011110101110100000100111001110100010110110101100111110101110100000100011100111001001011011111001111011100010111110111010111010000010100010111001101011100010111101111010111010000010011100111010001011011010110011111010111010000010001110011100100101101101011110 e7b8beeba0a2e6b8bdeba09ce8b6b3eba08e725be7b8beeba0a2e6b8bdeba09ce8b6b3eba08e725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)