To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡?衣??棒??醍???縡?衣??棒??醍???^ 111000110111000100111111100010001101111100111111001111111001011001011111001111110011111110010001111001110011111100111111001111111110001101110001001111111000100011011111001111110011111110010110010111110011111100111111100100011110011100111111001111110011111101011110 e3713f88df3f3f965f3f3f91e73f3f3fe3713f88df3f3f965f3f3f91e73f3f3f5e
EUC-JP 縡?衣??棒??醍???縡?衣??棒??醍???^ 111001011101001000111111101100001110000100111111001111111100101111000000001111110011111111000010111010010011111100111111001111111110010111010010001111111011000011100001001111110011111111001011110000000011111100111111110000101110100100111111001111110011111101011110 e5d23fb0e13f3fcbc03f3fc2e93f3f3fe5d23fb0e13f3fcbc03f3fc2e93f3f3f5e
UTF-8 縡렕衣쭹렠棒렕렟醍닺렚쁩縡렕衣쭹렠棒렕렟醍닺렚쁠^ 11100111101110001010000111101011101000001001010111101000101000011010001111101100101011011011100111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101001100001101000110111101011100010111011101011101011101000001001101011101100100000011010100111100111101110001010000111101011101000001001010111101000101000011010001111101100101011011011100111101011101000001010000011100110101000111001001011101011101000001001010111101011101000001001111111101001100001101000110111101011100010111011101011101011101000001001101011101100100000011010000001011110 e7b8a1eba095e8a1a3ecadb9eba0a0e6a392eba095eba09fe9868deb8bbaeba09aec81a9e7b8a1eba095e8a1a3ecadb9eba0a0e6a392eba095eba09fe9868deb8bbaeba09aec81a05e
UHC 縡렕衣쭹렠棒렕렟醍닺렚쁩縡렕衣쭹렠棒렕렟醍닺렚쁠^ 11101110101011011000111010101010111010111111110111000010111001111000111010110001110111001110101010001110101010101000111010110000111100001011010110110100111010001000111010101101101110111101111011101110101011011000111010101010111010111111110111000010111001111000111010110001110111001110101010001110101010101000111010110000111100001011010110110100111010001000111010101101101110111101110001011110 eead8eaaebfdc2e78eb1dcea8eaa8eb0f0b5b4e88eadbbdeeead8eaaebfdc2e78eb1dcea8eaa8eb0f0b5b4e88eadbbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)