To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 髴托スケ陟厄スヲ}v髴托スケ陟厄スヲ}vB 1110100110011100100100011110111110111101101110011110100010100000100101101110111110111101101001100111110101110110111010011001110010010001111011111011110110111001111010001010000010010110111011111011110110100110011111010111011001000010 e99c91efbdb9e8a096efbda67d76e99c91efbdb9e8a096efbda67d7642
EUC-JP 髴托スケ陟厄スヲ}v髴托スケ陟厄スヲ}vB 11110001111111001100001011110001100011101011110110001110101110011111000010100010110011001111000110001110101111011000111010100110011111010111011011110001111111001100001011110001100011101011110110001110101110011111000010100010110011001111000110001110101111011000111010100110011111010111011001000010 f1fcc2f18ebd8eb9f0a2ccf18ebd8ea67d76f1fcc2f18ebd8eb9f0a2ccf18ebd8ea67d7642
UTF-8 髴托スケ陟厄スヲ}v髴托スケ陟厄スヲ}vB 1110100110101011101101001110011010001001100110001110111110111101101111011110111110111101101110011110100110011001100111111110010110001110100001001110111110111101101111011110111110111101101001100111110101110110111010011010101110110100111001101000100110011000111011111011110110111101111011111011110110111001111010011001100110011111111001011000111010000100111011111011110110111101111011111011110110100110011111010111011001000010 e9abb4e68998efbdbdefbdb9e9999fe58e84efbdbdefbda67d76e9abb4e68998efbdbdefbdb9e9999fe58e84efbdbdefbda67d7642
UHC ?托??陟厄??}v?托??陟厄??}vB 001111111111011011110101001111110011111111110100101100111110010011111000001111110011111101111101011101100011111111110110111101010011111100111111111101001011001111100100111110000011111100111111011111010111011001000010 3ff6f53f3ff4b3e4f83f3f7d763ff6f53f3ff4b3e4f83f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)