To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌η??ル?伍?????泳?????蟻??^ 11100010101000111000001111000101001111110011111110000011100010110011111110001100110111100011111100111111001111110011111100111111100010010110101000111111001111110011111100111111001111111000101101100001001111110011111101011110 e2a383c53f3f838b3f8cde3f3f3f3f3f896a3f3f3f3f3f8b613f3f5e
EUC-JP 筌η??ル?伍??煐??泳?????蟻??^ 111001001010010110100110110001110011111100111111101001011110101100111111101110001110000000111111001111111000111111001001111110000011111100111111101100011100101100111111001111110011111100111111001111111011010111000010001111110011111101011110 e4a5a6c73f3fa5eb3fb8e03f3f8fc9f83f3fb1cb3f3f3f3f3fb5c23f3f5e
UTF-8 筌η텚溜ル졁伍곸옺煐븝㎘泳볥젷溜꿱똻蟻곁성^ 111001111010110110001100110011101011011111101101100001011001101011101111101001111000101111100011100000111010101111101100101000011000000111100100101111001000110111101010101100111011100011101100100110001011101011100111100001011001000011101011101110001001110111100011100011101001100011100110101100111011001111101011101100111010010111101100101000001011011111101111101001111000101111101010101111111011000111101011100110001011101111101000100111111011101111101010101100111000000111101100100001001011000101011110 e7ad8cceb7ed859aefa78be383abeca181e4bc8deab3b8ec98bae78590ebb89de38e98e6b3b3ebb3a5eca0b7efa78beabfb1eb98bbe89fbbeab381ec84b15e
UHC 筌η텚溜ル졁伍곸옺煐븝㎘泳볥젷溜꿱똻蟻곁성^ 11101111101001111010010111100111101101101001001111101010111111101010101111101011101000001011001011100111111010101000000111101100100111101011000011100111101111001011101011101111101001111010010111100111101101101001001111101011101000001010101111101010111111101011001011101000100011001000000111101011111111001011000011100111101111001011101001011110 efa7a5e7b693eafeabeba0b2e7ea81ec9eb0e7bcbaefa7a5e7b693eba0abeafeb2e88c81ebfcb0e7bcba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)