To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲悉篠治篠シト竺偲竺偲治偲ミナ蒔篠シト叱 10001110110000111000111010111011100011101100001010001110101000011000111011000010101111001100010010001110101100011000111011000011100011101011000110001110110000111000111010100001100011101100001111010000110001011000111010101010100011101100001010111100110001001000111010110110 8ec38ebb8ec28ea18ec2bcc48eb18ec38eb18ec38ea18ec3d0c58eaa8ec2bcc48eb6
EUC-JP 偲悉篠治篠シト竺偲竺偲治偲ミナ蒔篠シト叱 10111100110001011011110010111101101111001100010010111100101000111011110011000100100011101011110010001110110001001011110010110011101111001100010110111100101100111011110011000101101111001010001110111100110001011000111011010000100011101100010110111100101011001011110011000100100011101011110010001110110001001011110010111000 bcc5bcbdbcc4bca3bcc48ebc8ec4bcb3bcc5bcb3bcc5bca3bcc58ed08ec5bcacbcc48ebc8ec4bcb8
UTF-8 偲悉篠治篠シト竺偲竺偲治偲ミナ蒔篠シト叱 111001011000000110110010111001101000001010001001111001111010111110100000111001101011001010111011111001111010111110100000111011111011110110111100111011111011111010000100111001111010101110111010111001011000000110110010111001111010101110111010111001011000000110110010111001101011001010111011111001011000000110110010111011111011111010010000111011111011111010000101111010001001001010010100111001111010111110100000111011111011110110111100111011111011111010000100111001011000111110110001 e581b2e68289e7afa0e6b2bbe7afa0efbdbcefbe84e7abbae581b2e7abbae581b2e6b2bbe581b2efbe90efbe85e89294e7afa0efbdbcefbe84e58fb1
UHC ?悉篠治篠??竺?竺?治???蒔篠??叱 001111111110001111111010111000011100011011110110101111011110000111000110001111110011111111110101111001110011111111110101111001110011111111110110101111010011111100111111001111111110001111001000111000011100011000111111001111111111001011101010 3fe3fae1c6f6bde1c63f3ff5e73ff5e73ff6bd3f3f3fe3c8e1c63f3ff2ea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)