To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 螂幄ュッ鬲假スォ鬲倅コ・辣「遏ョ雎ク陜 1110010110100101100110111110100010101101101011111110100110101101100110001110111110111101101010111110100110101101100110001110010010111010101001011110011110000101101000101110011110011111101011101110100010110001101110001110100010011101 e5a59be8adafe9ad98efbdabe9ad98e4baa5e785a2e79faee8b1b8e89d
EUC-JP 螂幄ュッ鬲假スォ鬲倅コ・辣「遏ョ雎ク陜 1110101010100111110101101110101010001110101011011000111010101111111100101010111111010000111100011000111010111101100011101010101111110010101011111101000011100110100011101011101010001110101001011110110111100101100011101010001011101110101000011000111010101110111100001011001110001110101110001110111111111101 eaa7d6ea8ead8eaff2afd0f18ebd8eabf2afd0e68eba8ea5ede58ea2eea18eaef0b38eb8effd
UTF-8 螂幄ュッ鬲假スォ鬲倅コ・辣「遏ョ雎ク陜 111010001001111010000010111001011011100110000100111011111011110110101101111011111011110110101111111010011010110010110010111001011000000110000111111011111011110110111101111011111011110110101011111010011010110010110010111001011000000010000101111011111011110110111010111011111011110110100101111010001011111010100011111011111011110110100010111010011000000110001111111011111011110110101110111010011001101110001110111011111011110110111000111010011001100110011100 e89e82e5b984efbdadefbdafe9acb2e58187efbdbdefbdabe9acb2e58085efbdbaefbda5e8bea3efbda2e9818fefbdaee99b8eefbdb8e9999c
UHC 螂幄???假??????辣???雎?陜 11010101110011001110010011000001001111110011111100111111110010101010001100111111001111110011111100111111001111110011111111010101101110000011111100111111001111111110111011010001001111111111100111110000 d5cce4c13f3f3fcaa33f3f3f3f3f3fd5b83f3f3feed13ff9f0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)