To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????D^SB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000100010111100101001101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f445e5342
SJIS-WIN テδ、テつオテつクテδッテェテクテ」ツュD^SB 11000011100000111100001010100100110000111000001011000010101101011100001110000010110000101011100011000011100000111100001010101111110000111010101011000011101110001100001110100011110000101010110101000100010111100101001101000010 c383c2a4c382c2b5c382c2b8c383c2afc3aac3b8c3a3c2ad445e5342
EUC-JP テδ、テつオテつクテδッテェテクテ」ツュD^SB 1000111011000011101001101100010010001110101001001000111011000011101001001100010010001110101101011000111011000011101001001100010010001110101110001000111011000011101001101100010010001110101011111000111011000011100011101010101010001110110000111000111010111000100011101100001110001110101000111000111011000010100011101010110101000100010111100101001101000010 8ec3a6c48ea48ec3a4c48eb58ec3a4c48eb88ec3a6c48eaf8ec38eaa8ec38eb88ec38ea38ec28ead445e5342
UTF-8 テδ、テつオテつクテδッテェテクテ」ツュD^SB 1110111110111110100000111100111010110100111011111011110110100100111011111011111010000011111000111000000110100100111011111011110110110101111011111011111010000011111000111000000110100100111011111011110110111000111011111011111010000011110011101011010011101111101111011010111111101111101111101000001111101111101111011010101011101111101111101000001111101111101111011011100011101111101111101000001111101111101111011010001111101111101111101000001011101111101111011010110101000100010111100101001101000010 efbe83ceb4efbda4efbe83e381a4efbdb5efbe83e381a4efbdb8efbe83ceb4efbdafefbe83efbdaaefbe83efbdb8efbe83efbda3efbe82efbdad445e5342
UHC ?δ??つ??つ??δ?????????D^SB 00111111101001011110010000111111001111111010101011000100001111110011111110101010110001000011111100111111101001011110010000111111001111110011111100111111001111110011111100111111001111110011111101000100010111100101001101000010 3fa5e43f3faac43f3faac43f3fa5e43f3f3f3f3f3f3f3f3f445e5342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)