To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN 偲自篠クト悉偲璽篠ヲト辞偲璽篠ヲト湿\ 10001110110000111000111010101001100011101100001010111000110001001000111010111011100011101100001110001110101000111000111011000010101001101100010010001110101010111000111011000011100011101010001110001110110000101010011011000100100011101011110001011100 8ec38ea98ec2b8c48ebb8ec38ea38ec2a6c48eab8ec38ea38ec2a6c48ebc5c
EUC-JP 偲自篠クト悉偲璽篠ヲト辞偲璽篠ヲト湿\ 10111100110001011011110010101011101111001100010010001110101110001000111011000100101111001011110110111100110001011011110010100101101111001100010010001110101001101000111011000100101111001010110110111100110001011011110010100101101111001100010010001110101001101000111011000100101111001011111001011100 bcc5bcabbcc48eb88ec4bcbdbcc5bca5bcc48ea68ec4bcadbcc5bca5bcc48ea68ec4bcbe5c
UTF-8 偲自篠クト悉偲璽篠ヲト辞偲璽篠ヲト湿\ 11100101100000011011001011101000100001111010101011100111101011111010000011101111101111011011100011101111101111101000010011100110100000101000100111100101100000011011001011100111100100101011110111100111101011111010000011101111101111011010011011101111101111101000010011101000101111101001111011100101100000011011001011100111100100101011110111100111101011111010000011101111101111011010011011101111101111101000010011100110101110011011111101011100 e581b2e887aae7afa0efbdb8efbe84e68289e581b2e792bde7afa0efbda6efbe84e8be9ee581b2e792bde7afa0efbda6efbe84e6b9bf5c
UHC ?自篠??悉?璽篠????璽篠???\ 0011111111101101101110111110000111000110001111110011111111100011111110100011111111011111110111101110000111000110001111110011111100111111001111111101111111011110111000011100011000111111001111110011111101011100 3fedbbe1c63f3fe3fa3fdfdee1c63f3f3f3fdfdee1c63f3f3f5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)