To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 礁ス捨﨩治硝ロ杓礁ス捨﨩治硝ロ杓B 100011111100101010111101100011101100110011111011111010101000111010100001100011111100100111110100100011101101101110001110110110111000111111001010101111011000111011001100111110111110101010001110101000011000111111001001111101001000111011011011100011101101101101000010 8fcabd8eccfbea8ea18fc9f48edb8edb8fcabd8eccfbea8ea18fc9f48edb8edb42
EUC-JP 礁ス捨?治硝?ロ杓礁ス捨?治硝?ロ杓B 101111101100110010001110101111011011110011001110001111111011110010100011101111101100101100111111100011101101101110111100110111011011111011001100100011101011110110111100110011100011111110111100101000111011111011001011001111111000111011011011101111001101110101000010 becc8ebdbcce3fbca3becb3f8edbbcddbecc8ebdbcce3fbca3becb3f8edbbcdd42
UTF-8 礁ス捨﨩治硝ロ杓礁ス捨﨩治硝ロ杓B 11100111101001001000000111101111101111011011110111100110100011011010100011101111101010001010100111100110101100101011101111100111101000011001110111101110100011001011110111101111101111101001101111100110100111011001001111100111101001001000000111101111101111011011110111100110100011011010100011101111101010001010100111100110101100101011101111100111101000011001110111101110100011001011110111101111101111101001101111100110100111011001001101000010 e7a481efbdbde68da8efa8a9e6b2bbe7a19dee8cbdefbe9be69d93e7a481efbdbde68da8efa8a9e6b2bbe7a19dee8cbdefbe9be69d9342
UHC 礁?捨?治硝??杓礁?捨?治硝??杓B 1111010110100111001111111101111011010111001111111111011010111101111101011010011000111111001111111111100011110101111101011010011100111111110111101101011100111111111101101011110111110101101001100011111100111111111110001111010101000010 f5a73fded73ff6bdf5a63f3ff8f5f5a73fded73ff6bdf5a63f3ff8f542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)