To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN セ謗ムシ狎骼アシソn}セ謗ムシ狎骼アシソn{^ 1011111011100110100011101101000110111100111000001011111011101001100011101011000110111100101111110110111001111101101111101110011010001110110100011011110011100000101111101110100110001110101100011011110010111111011011100111101101011110 bee68ed1bce0bee98eb1bcbf6e7dbee68ed1bce0bee98eb1bcbf6e7b5e
EUC-JP セ謗ムシ狎骼アシソn}セ謗ムシ狎骼アシソn{^ 1000111010111110111010111110111010001110110100011000111010111100111000001100000011110001111011101000111010110001100011101011110010001110101111110110111001111101100011101011111011101011111011101000111011010001100011101011110011100000110000001111000111101110100011101011000110001110101111001000111010111111011011100111101101011110 8ebeebee8ed18ebce0c0f1ee8eb18ebc8ebf6e7d8ebeebee8ed18ebce0c0f1ee8eb18ebc8ebf6e7b5e
UTF-8 セ謗ムシ狎骼アシソn}セ謗ムシ狎骼アシソn{^ 1110111110111101101111101110100010101100100101111110111110111110100100011110111110111101101111001110011110001011100011101110100110101010101111001110111110111101101100011110111110111101101111001110111110111101101111110110111001111101111011111011110110111110111010001010110010010111111011111011111010010001111011111011110110111100111001111000101110001110111010011010101010111100111011111011110110110001111011111011110110111100111011111011110110111111011011100111101101011110 efbdbee8ac97efbe91efbdbce78b8ee9aabcefbdb1efbdbcefbdbf6e7defbdbee8ac97efbe91efbdbce78b8ee9aabcefbdb1efbdbcefbdbf6e7b5e
UHC ?謗??狎????n}?謗??狎????n{^ 001111111101101110111111001111110011111111100100111001000011111100111111001111110011111101101110011111010011111111011011101111110011111100111111111001001110010000111111001111110011111100111111011011100111101101011110 3fdbbf3f3fe4e43f3f3f3f6e7d3fdbbf3f3fe4e43f3f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)