To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 夜??厓э?厓ц?夜??厓э?厓ц?^ 1001011011101001001111110011111111111010100011011000010010001111001111111111101010001101100001001000100000111111100101101110100100111111001111111111101010001101100001001000111100111111111110101000110110000100100010000011111101011110 96e93f3ffa8d848f3ffa8d84883f96e93f3ffa8d848f3ffa8d84883f5e
EUC-JP 夜??厓э?厓ц?夜??厓э?厓ц?^ 110011001110101100111111001111111000111110110100110001111010011111101111001111111000111110110100110001111010011111101000001111111100110011101011001111110011111110001111101101001100011110100111111011110011111110001111101101001100011110100111111010000011111101011110 cceb3f3f8fb4c7a7ef3f8fb4c7a7e83fcceb3f3f8fb4c7a7ef3f8fb4c7a7e83f5e
UTF-8 夜쇽푳厓э푶厓ц븪夜쇽푳厓э푶厓ц몚^ 111001011010010010011100111011001000011110111101111011011001000110110011111001011000111010010011110100011000110111101101100100011011011011100101100011101001001111010001100001101110101110111000101010101110010110100100100111001110110010000111101111011110110110010001101100111110010110001110100100111101000110001101111011011001000110110110111001011000111010010011110100011000011011101011101010101001101001011110 e5a49cec87bded91b3e58e93d18ded91b6e58e93d186ebb8aae5a49cec87bded91b3e58e93d18ded91b6e58e93d186ebaa9a5e
UHC 夜쇽푳厓э푶厓ц븪夜쇽푳厓э푶厓ц몚^ 11100101101010001011110011101111101111101000000111100100111011011010110011101111101111101000010011100100111011011010110011101000100101011001001111100101101010001011110011101111101111101000000111100100111011011010110011101111101111101000010011100100111011011010110011101000100100011000100001011110 e5a8bcefbe81e4edacefbe84e4edace89593e5a8bcefbe81e4edacefbe84e4edace891885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)