To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 礁ス捨鏸湿上瘤ン写礁ス捨鏸湿上瘤ン写B 100011111100101010111101100011101100110011111011111001001000111010111100100011111110001111100001100011101101110110001110110010101000111111001010101111011000111011001100111110111110010010001110101111001000111111100011111000011000111011011101100011101100101001000010 8fcabd8eccfbe48ebc8fe3e18edd8eca8fcabd8eccfbe48ebc8fe3e18edd8eca42
EUC-JP 礁ス捨鏸湿上瘤ン写礁ス捨鏸湿上瘤ン写B 101111101100110010001110101111011011110011001110100011111110010111010000101111001011111010111110111001011110000111101110100011101101110110111100110011001011111011001100100011101011110110111100110011101000111111100101110100001011110010111110101111101110010111100001111011101000111011011101101111001100110001000010 becc8ebdbcce8fe5d0bcbebee5e1ee8eddbcccbecc8ebdbcce8fe5d0bcbebee5e1ee8eddbccc42
UTF-8 礁ス捨鏸湿上瘤ン写礁ス捨鏸湿上瘤ン写B 11100111101001001000000111101111101111011011110111100110100011011010100011101001100011111011100011100110101110011011111111100100101110001000101011100111100110001010010011101111101111101001110111100101100001101001100111100111101001001000000111101111101111011011110111100110100011011010100011101001100011111011100011100110101110011011111111100100101110001000101011100111100110001010010011101111101111101001110111100101100001101001100101000010 e7a481efbdbde68da8e98fb8e6b9bfe4b88ae798a4efbe9de58699e7a481efbdbde68da8e98fb8e6b9bfe4b88ae798a4efbe9de5869942
UHC 礁?捨??上瘤??礁?捨??上瘤??B 111101011010011100111111110111101101011100111111001111111101111110111110110101111011101100111111001111111111010110100111001111111101111011010111001111110011111111011111101111101101011110111011001111110011111101000010 f5a73fded73f3fdfbed7bb3f3ff5a73fded73f3fdfbed7bb3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)