To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲ヲト而偲、ドナ、ト鹿偲ヲト磁偲、ト竺 10001110110000111010011011000100100011101010011110001110110000111010010011000100110111101100010110100100110001001000111010101101100011101100001110100110110001001000111010100101100011101100001110100100110001001000111010110001 8ec3a6c48ea78ec3a4c4dec5a4c48ead8ec3a6c48ea58ec3a4c48eb1
EUC-JP 偲ヲト而偲、ドナ、ト鹿偲ヲト磁偲、ト竺 10111100110001011000111010100110100011101100010010111100101010011011110011000101100011101010010010001110110001001000111011011110100011101100010110001110101001001000111011000100101111001010111110111100110001011000111010100110100011101100010010111100101001111011110011000101100011101010010010001110110001001011110010110011 bcc58ea68ec4bca9bcc58ea48ec48ede8ec58ea48ec4bcafbcc58ea68ec4bca7bcc58ea48ec4bcb3
UTF-8 偲ヲト而偲、ドナ、ト鹿偲ヲト磁偲、ト竺 111001011000000110110010111011111011110110100110111011111011111010000100111010001000000010001100111001011000000110110010111011111011110110100100111011111011111010000100111011111011111010011110111011111011111010000101111011111011110110100100111011111011111010000100111010011011100110111111111001011000000110110010111011111011110110100110111011111011111010000100111001111010001110000001111001011000000110110010111011111011110110100100111011111011111010000100111001111010101110111010 e581b2efbda6efbe84e8808ce581b2efbda4efbe84efbe9eefbe85efbda4efbe84e9b9bfe581b2efbda6efbe84e7a381e581b2efbda4efbe84e7abba
UHC ???而???????鹿???磁???竺 001111110011111100111111111011001011101100111111001111110011111100111111001111110011111100111111110101101110001100111111001111110011111111101101101110000011111100111111001111111111010111100111 3f3f3fecbb3f3f3f3f3f3f3fd6e33f3f3fedb83f3f3ff5e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)