To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥≪?以?????筌??猷???ル?腋 1001101010001011100000011110000100111111100010001100100000111111001111110011111100111111001111111110001010100011001111110011111110010111010100010011111100111111001111111000001110001011001111111110001111111100 9a8b81e13f88c83f3f3f3f3fe2a33f3f97513f3f3f838b3fe3fc
EUC-JP 嚥≪?以??洧??筌??猷???ル?腋 11010011111010111010001011100011001111111011000011001010001111110011111110001111110001111011010000111111001111111110010010100101001111110011111111001101101100100011111100111111001111111010010111101011001111111110011011111110 d3eba2e33fb0ca3f3f8fc7b43f3fe4a53f3fcdb23f3f3fa5eb3fe6fe
UTF-8 嚥≪늾以곫에洧얠젫筌뚯뼇猷뷴궟戮ル눓腋 111001011001101010100101111000101000100110101010111010111000101010111110111001001011101110100101111010101011001110101011111011001001011110010000111001101011010010100111111011001001011010100000111011001010000010101011111001111010110110001100111010111001101010101111111010111011110010000111111001111000110010110111111010111011011110110100111010101011011010011111111011111010011110010010111000111000001110101011111010111000100010010011111010001000010110001011 e59aa5e289aaeb8abee4bba5eab3abec9790e6b4a7ec96a0eca0abe7ad8ceb9aafebbc87e78cb7ebb7b4eab69fefa792e383abeb8893e8858b
UHC 嚥≪늾以곫에洧얠젫筌뚯뼇猷뷴궟戮ル눓腋 1110011010111111101000011110110010001000100001111110110010100100100000011110011010111111101000011110101011111011101111101110110010100000101000111110111110100111100011001110110010010110100100011110101110100011101110101110010110000010101100101110101110111101101010111110101110000111101011111110010011111101 e6bfa1ec8887eca481e6bfa1eafbbeeca0a3efa78cec9691eba3bae582b2ebbdabeb87afe4fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)