To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 魚??揖??韋??艶l????娃??裕 1000101110011011001111110011111110010111010010110011111100111111111010001110100000111111001111111000100110010000100000101000110000111111001111110011111100111111100010001010000100111111001111111001011101010100 8b9b3f3f974b3f3fe8e83f3f8990828c3f3f3f3f88a13f3f9754
EUC-JP 魚??揖??韋??艶l?洹??娃??裕 10110101111110110011111100111111110011011010110000111111001111111111000011101010001111110011111110110001111100001010001111101100001111111000111111000111101110100011111100111111101100001010001100111111001111111100110110110101 b5fb3f3fcdac3f3ff0ea3f3fb1f0a3ec3f8fc7ba3f3fb0a33f3fcdb5
UTF-8 魚잙쉴揖먪땟韋얜짎艶l뫆洹욎넂娃뺢낮裕 111010011010110110011010111011001001111010011001111011001000100110110100111001101000111110010110111010111010100010101010111010111001010110011111111010011001111110001011111011001001011010011100111011001010011110001110111010001000100110110110111011111011110110001100111010111010101110000110111001101011010010111001111011001001101010001110111010111000010010000010111001011010100010000011111010111011101010100010111010111000001010101110111010001010001110010101 e9ad9aec9e99ec89b4e68f96eba8aaeb959fe99f8bec969ceca78ee889b6efbd8cebab86e6b4b9ec9a8eeb8482e5a883ebbaa2eb82aee8a395
UHC 魚잙쉴揖먪땟韋얜짎艶l뫆洹욎넂娃뺢낮裕 1110010111100000100111111110101110111101101011111110101111100111100100001110011110110110101011011110101011011111101111101110101110100011100110101110011011111101101000111110110010010001101010011110101010110111100111101110110010000110100100101110100011011111100101011110101010110011101101111110101110101110 e5e09febbdafebe790e7b6adeadfbeeba39ae6fda3ec91a9eab79eec8692e8df95eab3b7ebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)