To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nf???n^}Y???nf???n^}bE 00111111001111110011111101101110011001100011111100111111001111110110111001011110011111010101100100111111001111110011111101101110011001100011111100111111001111110110111001011110011111010110001001000101 3f3f3f6e663f3f3f6e5e7d593f3f3f6e663f3f3f6e5e7d6245
SJIS-WIN チシ猯nfチシ猯n^}Yチシ猯nfチシ猯n^}bE 1100000110111100111000001100110001101110011001101100000110111100111000001100110001101110010111100111110101011001110000011011110011100000110011000110111001100110110000011011110011100000110011000110111001011110011111010110001001000101 c1bce0cc6e66c1bce0cc6e5e7d59c1bce0cc6e66c1bce0cc6e5e7d6245
EUC-JP チシ猯nfチシ猯n^}Yチシ猯nfチシ猯n^}bE 10001110110000011000111010111100111000001100111001101110011001101000111011000001100011101011110011100000110011100110111001011110011111010101100110001110110000011000111010111100111000001100111001101110011001101000111011000001100011101011110011100000110011100110111001011110011111010110001001000101 8ec18ebce0ce6e668ec18ebce0ce6e5e7d598ec18ebce0ce6e668ec18ebce0ce6e5e7d6245
UTF-8 チシ猯nfチシ猯n^}Yチシ猯nfチシ猯n^}bE 11101111101111101000000111101111101111011011110011100111100011001010111101101110011001101110111110111110100000011110111110111101101111001110011110001100101011110110111001011110011111010101100111101111101111101000000111101111101111011011110011100111100011001010111101101110011001101110111110111110100000011110111110111101101111001110011110001100101011110110111001011110011111010110001001000101 efbe81efbdbce78caf6e66efbe81efbdbce78caf6e5e7d59efbe81efbdbce78caf6e66efbe81efbdbce78caf6e5e7d6245
UHC ???nf???n^}Y???nf???n^}bE 00111111001111110011111101101110011001100011111100111111001111110110111001011110011111010101100100111111001111110011111101101110011001100011111100111111001111110110111001011110011111010110001001000101 3f3f3f6e663f3f3f6e5e7d593f3f3f6e663f3f3f6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)