To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ツ陳」ツ猟慊陳」 110000101001001011000010101000111100001010010111110000101001110011000010100100101100001010100011 c292c2a3c297c29cc292c2a3
EUC-JP ツ陳」ツ猟慊陳」 10001110110000101100010011000100100011101010001110001110110000101100111011000100110110001100010011000100110001001000111010100011 8ec2c4c48ea38ec2cec4d8c4c4c48ea3
UTF-8 ツ陳」ツ猟慊陳」 111011111011111010000010111010011001100110110011111011111011110110100011111011111011111010000010111001111000110010011111111001101000010110001010111010011001100110110011111011111011110110100011 efbe82e999b3efbda3efbe82e78c9fe6858ae999b3efbda3
UHC ?陳???慊陳? 0011111111110010111001110011111100111111001111111100110011000011111100101110011100111111 3ff2e73f3f3fccc3f2e73f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)