To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?庶????????^ 001111111000111110001110001111110011111100111111001111110011111100111111001111110011111101011110 3f8f8e3f3f3f3f3f3f3f3f5e
EUC-JP ?庶????????^ 001111111011110111101110001111110011111100111111001111110011111100111111001111110011111101011110 3fbdee3f3f3f3f3f3f3f3f5e
UTF-8 렱庶렎송솬앉렖샷송솩^ 11101011101000001011000111100101101110101011011011101011101000001000111011101100100001101010000111101100100001101010110011101100100101011000100111101011101000001001011011101100100000111011011111101100100001101010000111101100100001101010100101011110 eba0b1e5bab6eba08eec86a1ec86acec9589eba096ec83b7ec86a1ec86a95e
UHC 렱庶렎송솬앉렖샷송솩^ 100011101011111011011111111011101000111010100100101111001101101110111100110111111011111011001001100011101010101110111100101001101011110011011011101111001101111001011110 8ebedfee8ea4bcdbbcdfbec98eabbca6bcdbbcde5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)