To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 顆滂ス「鞜撰スソN}顆滂ス「鞜撰スソN{^ 1110100011110111100111111110111110111101101000101110100011011111100100001110111110111101101111110100111001111101111010001111011110011111111011111011110110100010111010001101111110010000111011111011110110111111010011100111101101011110 e8f79fefbda2e8df90efbdbf4e7de8f79fefbda2e8df90efbdbf4e7b5e
EUC-JP 顆滂ス「鞜撰スソN}顆滂ス「鞜撰スソN{^ 11110000111110011101111011110001100011101011110110001110101000101111000011100001110000001111000110001110101111011000111010111111010011100111110111110000111110011101111011110001100011101011110110001110101000101111000011100001110000001111000110001110101111011000111010111111010011100111101101011110 f0f9def18ebd8ea2f0e1c0f18ebd8ebf4e7df0f9def18ebd8ea2f0e1c0f18ebd8ebf4e7b5e
UTF-8 顆滂ス「鞜撰スソN}顆滂ス「鞜撰スソN{^ 1110100110100001100001101110011010111011100000101110111110111101101111011110111110111101101000101110100110011110100111001110011010010010101100001110111110111101101111011110111110111101101111110100111001111101111010011010000110000110111001101011101110000010111011111011110110111101111011111011110110100010111010011001111010011100111001101001001010110000111011111011110110111101111011111011110110111111010011100111101101011110 e9a186e6bb82efbdbdefbda2e99e9ce692b0efbdbdefbdbf4e7de9a186e6bb82efbdbdefbda2e99e9ce692b0efbdbdefbdbf4e7b5e
UHC 顆滂???撰??N}顆滂???撰??N{^ 110011101010100011011011101101010011111100111111001111111111001110111100001111110011111101001110011111011100111010101000110110111011010100111111001111110011111111110011101111000011111100111111010011100111101101011110 cea8dbb53f3f3ff3bc3f3f4e7dcea8dbb53f3f3ff3bc3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)