To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????Þ???????????Þ???^ 00111111001111110011111100111111001111110011111100111111001111111101111000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101111000111111001111110011111101011110 3f3f3f3f3f3f3f3fde3f3f3f3f3f3f3f3f3f3f3fde3f3f3f5e
SJIS-WIN 孃る??????????孃る??????????^ 1001101101101111100000101110100100111111001111110011111100111111001111110011111100111111001111110011111100111111100110110110111110000010111010010011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 9b6f82e93f3f3f3f3f3f3f3f3f3f9b6f82e93f3f3f3f3f3f3f3f3f3f5e
EUC-JP 孃る??????Þ縕??孃る??????Þ縕??^ 11010101110100001010010011101011001111110011111100111111001111110011111100111111100011111010100110110000100011111101010011000010001111110011111111010101110100001010010011101011001111110011111100111111001111110011111100111111100011111010100110110000100011111101010011000010001111110011111101011110 d5d0a4eb3f3f3f3f3f3f8fa9b08fd4c23f3fd5d0a4eb3f3f3f3f3f3f8fa9b08fd4c23f3f5e
UTF-8 孃る죲溜띾ㅎ蓮붻Þ縕루껍孃る죲溜띾ㅎ蓮붻Þ縕룡쥈^ 1110010110101101100000111110001110000010100010111110110010100011101100101110111110100111100010111110101110011101101111101110001110000101100011101110111110100110100110011110101110110110101110111100001110011110111001111011100010010101111010111010001110101000111010101011101110001101111001011010110110000011111000111000001010001011111011001010001110110010111011111010011110001011111010111001110110111110111000111000010110001110111011111010011010011001111010111011011010111011110000111001111011100111101110001001010111101011101000111010000111101100101001011000100001011110 e5ad83e3828beca3b2efa78beb9dbee3858eefa699ebb6bbc39ee7b895eba3a8eabb8de5ad83e3828beca3b2efa78beb9dbee3858eefa699ebb6bbc39ee7b895eba3a1eca5885e
UHC 孃る죲溜띾ㅎ蓮붻Þ縕루껍孃る죲溜띾ㅎ蓮붻Þ縕룡쥈^ 11100101101111101010101011101011101000011000110111101010111111101000110111101011101001001011111011100110111001011001010011101000101010001010110111101000101100101011011111100111101100101010111011100101101111101010101011101011101000011000110111101010111111101000110111101011101001001011111011100110111001011001010011101000101010001010110111101000101100101011011111100110101000101000000101011110 e5beaaeba18deafe8deba4bee6e594e8a8ade8b2b7e7b2aee5beaaeba18deafe8deba4bee6e594e8a8ade8b2b7e6a2815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)