To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??N???????^ 0011111100111111010011100011111100111111001111110011111100111111001111110011111101011110 3f3f4e3f3f3f3f3f3f3f5e
SJIS-WIN テ」Nツづ個古ゥツ織^ 110000111010001101001110110000101000001011000011100011001100001010001100110000111010100111000010100100000100010001011110 c3a34ec282c38cc28cc3a9c290445e
EUC-JP テ」Nツづ個古ゥツ織^ 1000111011000011100011101010001101001110100011101100001010100100110001011011100011000100101110001100010110001110101010011000111011000010101111111010010101011110 8ec38ea34e8ec2a4c5b8c4b8c58ea98ec2bfa55e
UTF-8 テ」Nツづ個古ゥツ織^ 1110111110111110100000111110111110111101101000110100111011101111101111101000001011100011100000011010010111100101100000001000101111100101100011111010010011101111101111011010100111101111101111101000001011100111101110011001010001011110 efbe83efbda34eefbe82e381a5e5808be58fa4efbda9efbe82e7b9945e
UHC ??N?づ個古??織^ 001111110011111101001110001111111010101011000101110010111100000111001101101011110011111100111111111100101100010001011110 3f3f4e3faac5cbc1cdaf3f3ff2c45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)