To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8柚??儒??寤μ?孺???ル?檍??意? 111000011001111100111111100000100101011110010111010011010011111100111111100011101111001000111111001111111001101110001000100000111100101000111111100110110111110100111111001111110011111110000011100010110011111110011110111110000011111100111111100010001101001100111111 e19f3f8257974d3f3f8ef23f3f9b8883ca3f9b7d3f3f3f838b3f9ef83f3f88d33f
EUC-JP 癲?8柚??儒??寤μ?孺???ル?檍??意? 111000101010000100111111101000111011100011001101101011100011111100111111101111001111010000111111001111111101010111101000101001101100110000111111110101011101111000111111001111110011111110100101111010110011111111011100111110100011111100111111101100001101010100111111 e2a13fa3b8cdae3f3fbcf43f3fd5e8a6cc3fd5de3f3f3fa5eb3fdcfa3f3fb0d53f
UTF-8 癲쒕8柚얍젆儒쇱뵖寤μ럩孺양쳥戮ル윣檍욍굥意쯊 1110011110011001101100101110110010010010100101011110111110111100100110001110011010011111100110101110110010010110100011011110110010100000100001101110010110000100100100101110110010000111101100011110101110110101100101101110010110101111101001001100111010111100111010111001111110101001111001011010110110111010111011001001011010010001111011001011001110100101111011111010011110010010111000111000001110101011111011001001110010100011111001101010101010001101111011001001101010001101111010101011010110100101111001101000010010001111111011001010111110001010 e799b2ec9295efbc98e69f9aec968deca086e58492ec87b1ebb596e5afa4cebceb9fa9e5adbaec9691ecb3a5efa792e383abec9ca3e6aa8dec9a8deab5a5e6848fecaf8a
UHC 癲쒕8柚얍젆儒쇱뵖寤μ럩孺양쳥戮ル윣檍욍굥意쯊 11101111101001101001110011101011101000111011100011101010111101101011111011100101101000001000100111101010111000111011110011101100100101001001100011100111111101011010010111101100100011101000110011101010111010001011111011100111101010111000101011101011101111011010101111101011100111111010010011100101111001011011111111100011100000101000101111101011111100101010100101000110 efa69ceba3b8eaf6bee5a089eae3bcec9498e7f5a5ec8e8ceae8bee7ab8aebbdabeb9fa4e5e5bfe3828bebf2a946

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)