To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?S??\}?S??\{^ 00111111010100110011111100111111010111000111110100111111010100110011111100111111010111000111101101011110 3f533f3f5c7d3f533f3f5c7b5e
SJIS-WIN ?S??\}?S??\{^ 00111111010100110011111100111111010111000111110100111111010100110011111100111111010111000111101101011110 3f533f3f5c7d3f533f3f5c7b5e
EUC-JP ?S??\}?S??\{^ 00111111010100110011111100111111010111000111110100111111010100110011111100111111010111000111101101011110 3f533f3f5c7d3f533f3f5c7b5e
UTF-8 청S횇횧\}청S횇횧\{^ 11101100101100101010110101010011111011011001101010000111111011011001101010100111010111000111110111101100101100101010110101010011111011011001101010000111111011011001101010100111010111000111101101011110 ecb2ad53ed9a87ed9aa75c7decb2ad53ed9a87ed9aa75c7b5e
UHC 청S횇횧\}청S횇횧\{^ 11000011101110110101001111000011100001011100001110011110010111000111110111000011101110110101001111000011100001011100001110011110010111000111101101011110 c3bb53c385c39e5c7dc3bb53c385c39e5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)