To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??U???????U?????\ 0011111100111111010101010011111100111111001111110011111100111111001111110011111101010101001111110011111100111111001111110011111101011100 3f3f553f3f3f3f3f3f3f553f3f3f3f3f5c
SJIS-WIN テ」Uツ甘ヲツ陰テ」Uツ古ュツ秀\ 110000111010001101010101110000101000101011000011101001101100001010001001010000011100001110100011010101011100001010001100110000111010110111000010100011110100011101011100 c3a355c28ac3a6c28941c3a355c28cc3adc28f475c
EUC-JP テ」Uツ甘ヲツ陰テ」Uツ古ュツ秀\ 10001110110000111000111010100011010101011000111011000010101101001100010110001110101001101000111011000010101100011010001010001110110000111000111010100011010101011000111011000010101110001100010110001110101011011000111011000010101111011010100001011100 8ec38ea3558ec2b4c58ea68ec2b1a28ec38ea3558ec2b8c58ead8ec2bda85c
UTF-8 テ」Uツ甘ヲツ陰テ」Uツ古ュツ秀\ 111011111011111010000011111011111011110110100011010101011110111110111110100000101110011110010100100110001110111110111101101001101110111110111110100000101110100110011001101100001110111110111110100000111110111110111101101000110101010111101111101111101000001011100101100011111010010011101111101111011010110111101111101111101000001011100111101001111000000001011100 efbe83efbda355efbe82e79498efbda6efbe82e999b0efbe83efbda355efbe82e58fa4efbdadefbe82e7a7805c
UHC ??U?甘??陰??U?古??秀\ 001111110011111101010101001111111100101011110110001111110011111111101011111001000011111100111111010101010011111111001101101011110011111100111111111000101011001101011100 3f3f553fcaf63f3febe43f3f553fcdaf3f3fe2b35c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)