To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN 雲?雲?[雲?雲?[^ 100010010101111100111111100010010101111100111111010110111000100101011111001111111000100101011111001111110101101101011110 895f3f895f3f5b895f3f895f3f5b5e
EUC-JP 雲?雲?[雲?雲?[^ 101100011100000000111111101100011100000000111111010110111011000111000000001111111011000111000000001111110101101101011110 b1c03fb1c03f5bb1c03fb1c03f5b5e
UTF-8 雲렱雲렱[雲렱雲렱[^ 111010011001101110110010111010111010000010110001111010011001101110110010111010111010000010110001010110111110100110011011101100101110101110100000101100011110100110011011101100101110101110100000101100010101101101011110 e99bb2eba0b1e99bb2eba0b15be99bb2eba0b1e99bb2eba0b15b5e
UHC 雲렱雲렱[雲렱雲렱[^ 11101010101000111000111010111110111010101010001110001110101111100101101111101010101000111000111010111110111010101010001110001110101111100101101101011110 eaa38ebeeaa38ebe5beaa38ebeeaa38ebe5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)