To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 螂ェ迢ク諱ッ隹キ驕應ソ玲」夂矯莉冶┳譌丞ュォ 111001011010010110101010111001111000101110111000111001101000000110101111111010001011000010110111111010011000000110011100111001001011111110010111111001101010001110011010111001111000101110111000111001001011101110010110111010001000010010110001111001101001011110001111111001011010110110101011 e5a5aae78bb8e681afe8b0b7e9819ce4bf97e6a39ae78bb8e4bb96e884b1e6978fe5adab
EUC-JP 螂ェ迢ク諱ッ隹キ驕應ソ玲」夂矯莉冶┳譌丞ュォ 1110101010100111100011101010101011101101111010111000111010111000111010111110000110001110101011111111000010110010100011101011011111110001111000011101100011100110100011101011111111001110111010001000111010100011110101001110100110110110101110101110100010111101110011001110101010101000101100111110101111110111101111101110011110001110101011011000111010101011 eaa78eaaedeb8eb8ebe18eaff0b28eb7f1e1d8e68ebfcee88ea3d4e9b6bae8bdcceaa8b3ebf7bee78ead8eab
UTF-8 螂ェ迢ク諱ッ隹キ驕應ソ玲」夂矯莉冶┳譌丞ュォ 111010001001111010000010111011111011110110101010111010001011111110100010111011111011110110111000111010001010101110110001111011111011110110101111111010011001101010111001111011111011110110110111111010011010100110010101111001101000011110001001111011111011110110111111111001111000111010110010111011111011110110100011111001011010010010000010111001111001111110101111111010001000111010001001111001011000011010110110111000101001010010110011111010001010110110001100111001001011100010011110111011111011110110101101111011111011110110101011 e89e82efbdaae8bfa2efbdb8e8abb1efbdafe99ab9efbdb7e9a995e68789efbdbfe78eb2efbda3e5a482e79fafe88e89e586b6e294b3e8ad8ce4b89eefbdadefbdab
UHC 螂???諱???驕應?玲??矯莉冶┳?丞?? 1101010111001100001111110011111100111111111111011100100100111111001111110011111111001110111101101110101111101011001111111101011010111100001111110011111111001110111011001101011111101001111001011010011110100110101100110011111111100011101010100011111100111111 d5cc3f3f3ffdc93f3f3fcef6ebeb3fd6bc3f3fceecd7e9e5a7a6b33fe3aa3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)