To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^z????????^zB 001111110011111100111111001111110011111100111111001111110011111101011110011110100011111100111111001111110011111100111111001111110011111100111111010111100111101001000010 3f3f3f3f3f3f3f3f5e7a3f3f3f3f3f3f3f3f5e7a42
SJIS-WIN 隘搾スヲ鬲假スォ^z隘搾スヲ鬲假スォ^zB 1110100010100101100011011110111110111101101001101110100110101101100110001110111110111101101010110101111001111010111010001010010110001101111011111011110110100110111010011010110110011000111011111011110110101011010111100111101001000010 e8a58defbda6e9ad98efbdab5e7ae8a58defbda6e9ad98efbdab5e7a42
EUC-JP 隘搾スヲ鬲假スォ^z隘搾スヲ鬲假スォ^zB 11110000101001111011101011110001100011101011110110001110101001101111001010101111110100001111000110001110101111011000111010101011010111100111101011110000101001111011101011110001100011101011110110001110101001101111001010101111110100001111000110001110101111011000111010101011010111100111101001000010 f0a7baf18ebd8ea6f2afd0f18ebd8eab5e7af0a7baf18ebd8ea6f2afd0f18ebd8eab5e7a42
UTF-8 隘搾スヲ鬲假スォ^z隘搾スヲ鬲假スォ^zB 1110100110011010100110001110011010010000101111101110111110111101101111011110111110111101101001101110100110101100101100101110010110000001100001111110111110111101101111011110111110111101101010110101111001111010111010011001101010011000111001101001000010111110111011111011110110111101111011111011110110100110111010011010110010110010111001011000000110000111111011111011110110111101111011111011110110101011010111100111101001000010 e99a98e690beefbdbdefbda6e9acb2e58187efbdbdefbdab5e7ae99a98e690beefbdbdefbda6e9acb2e58187efbdbdefbdab5e7a42
UHC 隘搾???假??^z隘搾???假??^zB 111001001111011011110011101101100011111100111111001111111100101010100011001111110011111101011110011110101110010011110110111100111011011000111111001111110011111111001010101000110011111100111111010111100111101001000010 e4f6f3b63f3f3fcaa33f3f5e7ae4f6f3b63f3f3fcaa33f3f5e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)