To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????e????????????e????z 0011111100111111001111110011111101100101001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011001010011111100111111001111110011111101111010 3f3f3f3f653f3f3f3f3f3f3f3f3f3f3f3f653f3f3f3f7a
SJIS-WIN 偲ヲト治e偲ヲト示偲、ト竺偲ヲト治e偲ヲト爾z 100011101100001110100110110001001000111010100001011001011000111011000011101001101100010010001110101001101000111011000011101001001100010010001110101100011000111011000011101001101100010010001110101000010110010110001110110000111010011011000100100011101010001001111010 8ec3a6c48ea1658ec3a6c48ea68ec3a4c48eb18ec3a6c48ea1658ec3a6c48ea27a
EUC-JP 偲ヲト治e偲ヲト示偲、ト竺偲ヲト治e偲ヲト爾z 10111100110001011000111010100110100011101100010010111100101000110110010110111100110001011000111010100110100011101100010010111100101010001011110011000101100011101010010010001110110001001011110010110011101111001100010110001110101001101000111011000100101111001010001101100101101111001100010110001110101001101000111011000100101111001010010001111010 bcc58ea68ec4bca365bcc58ea68ec4bca8bcc58ea48ec4bcb3bcc58ea68ec4bca365bcc58ea68ec4bca47a
UTF-8 偲ヲト治e偲ヲト示偲、ト竺偲ヲト治e偲ヲト爾z 111001011000000110110010111011111011110110100110111011111011111010000100111001101011001010111011011001011110010110000001101100101110111110111101101001101110111110111110100001001110011110100100101110101110010110000001101100101110111110111101101001001110111110111110100001001110011110101011101110101110010110000001101100101110111110111101101001101110111110111110100001001110011010110010101110110110010111100101100000011011001011101111101111011010011011101111101111101000010011100111100010001011111001111010 e581b2efbda6efbe84e6b2bb65e581b2efbda6efbe84e7a4bae581b2efbda4efbe84e7abbae581b2efbda6efbe84e6b2bb65e581b2efbda6efbe84e788be7a
UHC ???治e???示???竺???治e???爾z 00111111001111110011111111110110101111010110010100111111001111110011111111100011110001100011111100111111001111111111010111100111001111110011111100111111111101101011110101100101001111110011111100111111111011001011001101111010 3f3f3ff6bd653f3f3fe3c63f3f3ff5e73f3f3ff6bd653f3f3fecb37a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)