To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 硝骼ハ丞齊ク踐治酌硝骼ハ丞齊ク踐治灼^ 100011111100100111101001100011101100101010001111111001011110101010001110101110001110011011110110100011101010000110001110110111101000111111001001111010011000111011001010100011111110010111101010100011101011100011100110111101101000111010100001100011101101110001011110 8fc9e98eca8fe5ea8eb8e6f68ea18ede8fc9e98eca8fe5ea8eb8e6f68ea18edc5e
EUC-JP 硝骼ハ丞齊ク踐治酌硝骼ハ丞齊ク踐治灼^ 10111110110010111111000111101110100011101100101010111110111001111111001111101110100011101011100011101100111110001011110010100011101111001110000010111110110010111111000111101110100011101100101010111110111001111111001111101110100011101011100011101100111110001011110010100011101111001101111001011110 becbf1ee8ecabee7f3ee8eb8ecf8bca3bce0becbf1ee8ecabee7f3ee8eb8ecf8bca3bcde5e
UTF-8 硝骼ハ丞齊ク踐治酌硝骼ハ丞齊ク踐治灼^ 11100111101000011001110111101001101010101011110011101111101111101000101011100100101110001001111011101001101111011000101011101111101111011011100011101000101110001001000011100110101100101011101111101001100001011000110011100111101000011001110111101001101010101011110011101111101111101000101011100100101110001001111011101001101111011000101011101111101111011011100011101000101110001001000011100110101100101011101111100111100000011011110001011110 e7a19de9aabcefbe8ae4b89ee9bd8aefbdb8e8b890e6b2bbe9858ce7a19de9aabcefbe8ae4b89ee9bd8aefbdb8e8b890e6b2bbe781bc5e
UHC 硝??丞齊?踐治酌硝??丞齊?踐治灼^ 11110101101001100011111100111111111000111010101011110000101110100011111111110100110000101111011010111101111011011100110011110101101001100011111100111111111000111010101011110000101110100011111111110100110000101111011010111101111011011100011101011110 f5a63f3fe3aaf0ba3ff4c2f6bdedccf5a63f3fe3aaf0ba3ff4c2f6bdedc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)