To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 剪?芝?闇???耳?壁剪?芝?闇???耳?批^ 100110011001001000111111100011101100010100111111100010001100010100111111001111110011111110001110101010000011111110010101110001111001100110010010001111111000111011000101001111111000100011000101001111110011111100111111100011101010100000111111100101001110000101011110 99923f8ec53f88c53f3f3f8ea83f95c799923f8ec53f88c53f3f3f8ea83f94e15e
EUC-JP 剪?芝?闇???耳?壁剪?芝?闇???耳?批^ 110100011111001000111111101111001100011100111111101100001100011100111111001111110011111110111100101010100011111111001010110010011101000111110010001111111011110011000111001111111011000011000111001111110011111100111111101111001010101000111111110010001110001101011110 d1f23fbcc73fb0c73f3f3fbcaa3fcac9d1f23fbcc73fb0c73f3f3fbcaa3fc8e35e
UTF-8 剪렮芝렫闇쮸렠렗耳얕壁剪렮芝렫闇쮸렠렗耳양批^ 11100101100010011010101011101011101000001010111011101000100010101001110111101011101000001010101111101001100101111000011111101100101011101011100011101011101000001010000011101011101000001001011111101000100000001011001111101100100101101001010111100101101000111000000111100101100010011010101011101011101000001010111011101000100010101001110111101011101000001010101111101001100101111000011111101100101011101011100011101011101000001010000011101011101000001001011111101000100000001011001111101100100101101001000111100110100010011011100101011110 e589aaeba0aee88a9deba0abe99787ecaeb8eba0a0eba097e880b3ec9695e5a381e589aaeba0aee88a9deba0abe99787ecaeb8eba0a0eba097e880b3ec9691e689b95e
UHC 剪렮芝렫闇쮸렠렗耳얕壁剪렮芝렫闇쮸렠렗耳양批^ 111011101111001010001110101110111111001010111001100011101011100111100100111000011100001011101001100011101011000110001110101011001110110010111100101111101110100011011011111110101110111011110010100011101011101111110010101110011000111010111001111001001110000111000010111010011000111010110001100011101010110011101100101111001011111011100111110111011110101101011110 eef28ebbf2b98eb9e4e1c2e98eb18eacecbcbee8dbfaeef28ebbf2b98eb9e4e1c2e98eb18eacecbcbee7ddeb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)