To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ツ形ツ坦ツ巽ツ形 110000101000110001100000110000101001001001010010110000101001001001000110110000101000110001100000 c28c60c29252c29246c28c60
EUC-JP ツ形ツ坦ツ巽ツ形 10001110110000101011011111000001100011101100001011000011101100111000111011000010110000111010011110001110110000101011011111000001 8ec2b7c18ec2c3b38ec2c3a78ec2b7c1
UTF-8 ツ形ツ坦ツ巽ツ形 111011111011111010000010111001011011110110100010111011111011111010000010111001011001110110100110111011111011111010000010111001011011011110111101111011111011111010000010111001011011110110100010 efbe82e5bda2efbe82e59da6efbe82e5b7bdefbe82e5bda2
UHC ?形?坦?巽?形 001111111111101110100001001111111111011110100100001111111110000111011110001111111111101110100001 3ffba13ff7a43fe1de3ffba1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)