To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 上ケ上瘤メ竺漆酌上ケ上瘤メ竺漆灼^ 10001111111000111111001010111100101110011000111111100011111000011000111011010010111100011110111010001110101100011000111010111101100011101101111010001111111000111111001010111100101110011000111111100011111000011000111011010010111100011110111010001110101100011000111010111101100011101101110001011110 8fe3f2bcb98fe3e18ed2f1ee8eb18ebd8ede8fe3f2bcb98fe3e18ed2f1ee8eb18ebd8edc5e
EUC-JP 上?ケ上瘤メ?竺漆酌上?ケ上瘤メ?竺漆灼^ 10111110111001010011111110001110101110011011111011100101111000011110111010001110110100100011111110111100101100111011110010111111101111001110000010111110111001010011111110001110101110011011111011100101111000011110111010001110110100100011111110111100101100111011110010111111101111001101111001011110 bee53f8eb9bee5e1ee8ed23fbcb3bcbfbce0bee53f8eb9bee5e1ee8ed23fbcb3bcbfbcde5e
UTF-8 上ケ上瘤メ竺漆酌上ケ上瘤メ竺漆灼^ 11100100101110001000101011101110100001111011001111101111101111011011100111100100101110001000101011100111100110001010010011101111101111101001001011101110100001011010100111100111101010111011101011100110101111001000011011101001100001011000110011100100101110001000101011101110100001111011001111101111101111011011100111100100101110001000101011100111100110001010010011101111101111101001001011101110100001011010100111100111101010111011101011100110101111001000011011100111100000011011110001011110 e4b88aee87b3efbdb9e4b88ae798a4efbe92ee85a9e7abbae6bc86e9858ce4b88aee87b3efbdb9e4b88ae798a4efbe92ee85a9e7abbae6bc86e781bc5e
UHC 上??上瘤??竺漆酌上??上瘤??竺漆灼^ 110111111011111000111111001111111101111110111110110101111011101100111111001111111111010111100111111101101101010011101101110011001101111110111110001111110011111111011111101111101101011110111011001111110011111111110101111001111111011011010100111011011100011101011110 dfbe3f3fdfbed7bb3f3ff5e7f6d4edccdfbe3f3fdfbed7bb3f3ff5e7f6d4edc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)