To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霄ース疾蒔ク謗クース疾蒔湿^ 111010001011101010110000111100011000111010111101100011101011111011110001111000101000111010101010111100011110010110111000111001101000111010111000101100001111000110001110101111011000111010111110111100011110001010001110101010101111100111111100100011101011110001011110 e8bab0f18ebd8ebef1e28eaaf1e5b8e68eb8b0f18ebd8ebef1e28eaaf9fc8ebc5e
EUC-JP 霄ー?ス疾?蒔?ク謗クー?ス疾?蒔?湿^ 111100001011110010001110101100000011111110001110101111011011110011000000001111111011110010101100001111111000111010111000111010111110111010001110101110001000111010110000001111111000111010111101101111001100000000111111101111001010110000111111101111001011111001011110 f0bc8eb03f8ebdbcc03fbcac3f8eb8ebee8eb88eb03f8ebdbcc03fbcac3fbcbe5e
UTF-8 霄ース疾蒔ク謗クース疾蒔湿^ 11101001100111001000010011101111101111011011000011101110100001001000100111101111101111011011110111100111100101101011111011101110100001011001110111101000100100101001010011101110100001011010000011101111101111011011100011101000101011001001011111101111101111011011100011101111101111011011000011101110100001001000100111101111101111011011110111100111100101101011111011101110100001011001110111101000100100101001010011101110100111011001011111100110101110011011111101011110 e99c84efbdb0ee8489efbdbde796beee859de89294ee85a0efbdb8e8ac97efbdb8efbdb0ee8489efbdbde796beee859de89294ee9d97e6b9bf5e
UHC ????疾?蒔??謗????疾?蒔??^ 00111111001111110011111100111111111100101111000000111111111000111100100000111111001111111101101110111111001111110011111100111111001111111111001011110000001111111110001111001000001111110011111101011110 3f3f3f3ff2f03fe3c83f3fdbbf3f3f3f3ff2f03fe3c83f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)