To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?垣兢?棒??趙貊?肯????制耕??調 11100011011100010011111110001010010111111001100101011101001111111001011001011111001111110011111111100110111000101110011010111011001111111000110101101101001111110011111100111111001111111001000010100111100011010110101100111111001111111001001010110010 e3713f8a5f995d3f965f3f3fe6e2e6bb3f8d6d3f3f3f3f90a78d6b3f3f92b2
EUC-JP 縡?垣兢?棒??趙貊?肯??勖?制耕??調 111001011101001000111111101100111100000011010001101111100011111111001011110000000011111100111111111011001110010011101100101111010011111110111001110011100011111100111111100011111011001111101101001111111100000010101001101110011100110000111111001111111100010010110100 e5d23fb3c0d1be3fcbc03f3fece4ecbd3fb9ce3f3f8fb3ed3fc0a9b9cc3f3fc4b4
UTF-8 縡렕垣兢렠棒렕렟趙貊렚肯렖렕勖렢制耕렖렕調 111001111011100010100001111010111010000010010101111001011001111010100011111001011000010110100010111010111010000010100000111001101010001110010010111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010111010000010011010111010001000001010101111111010111010000010010110111010111010000010010101111001011000101110010110111010111010000010100010111001011000100010110110111010001000000010010101111010111010000010010110111010111010000010010101111010001010101010111111 e7b8a1eba095e59ea3e585a2eba0a0e6a392eba095eba09fe8b699e8b28aeba09ae882afeba096eba095e58b96eba0a2e588b6e88095eba096eba095e8aabf
UHC 縡렕垣兢렠棒렕렟趙貊렚肯렖렕勖렢制耕렖렕調 111011101010110110001110101010101110101010101111110100001110011110001110101100011101110011101010100011101010101010001110101100001111000011100001110110001110011110001110101011011101000011101001100011101010101110001110101010101110100111101101100011101011001111110000101001001100110011101001100011101010101110001110101010101111000011100000 eead8eaaeaafd0e78eb1dcea8eaa8eb0f0e1d8e78eadd0e98eab8eaae9ed8eb3f0a4cce98eab8eaaf0e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)