To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???援??蹂l?孃る????蹂?????依 0011111100111111001111111000100110000111001111110011111111100110111110001000001010001100001111111001101101101111100000101110100100111111001111110011111100111111111001101111100000111111001111110011111100111111001111111000100011001011 3f3f3f89873f3fe6f8828c3f9b6f82e93f3f3f3fe6f83f3f3f3f3f88cb
EUC-JP 艅??援??蹂l?孃る????蹂??獒??依 100011111101011011111101001111110011111110110001111001110011111100111111111011001111101010100011111011000011111111010101110100001010010011101011001111110011111100111111001111111110110011111010001111110011111110001111110010111011101100111111001111111011000011001101 8fd6fd3f3fb1e73f3fecfaa3ec3fd5d0a4eb3f3f3f3fecfa3f3f8fcbbb3f3fb0cd
UTF-8 艅덈퀩援좄짆蹂l맦孃る뜄痢싨찄蹂앷턄獒뺣끽依 111010001000100110000101111010111000110110001000111011011000000010101001111001101000111110110100111011001010001010000100111011001010011110000110111010001011100110000010111011111011110110001100111010111010011110100110111001011010110110000011111000111000001010001011111010111001110010000100111011111010011110100101111011001000101110101000111011001011000010000100111010001011100110000010111011001001010110110111111011011000010010000100111001111000110110010010111010111011101010100011111010111000000110111101111001001011111010011101 e88985eb8d88ed80a9e68fb4eca284eca786e8b982efbd8ceba7a6e5ad83e3828beb9c84efa7a5ec8ba8ecb084e8b982ec95b7ed8484e78d92ebbaa3eb81bde4be9d
UHC 艅덈퀩援좄짆蹂l맦孃る뜄痢싨찄蹂앷턄獒뺣끽依 1110011010101001100010001110101110110011100111011110101010110101101000001110100010100011100101011110101110110011101000111110110010010000101011111110010110111110101010101110101110001101100010001110110010111000100110101110011010101001100010001110101110110011100111011110101010110101101000001110100010100011100101011110101110110011101000111110101111101110 e6a988ebb39deab5a0e8a395ebb3a3ec90afe5beaaeb8d88ecb89ae6a988ebb39deab5a0e8a395ebb3a3ebee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)