To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕?┿筍щ?罌??鎰??攸??渦 11100001100111110011111100111111100101110101010000111111100001001011100111100010101000011000010010001011001111111110001110100000001111110011111111101000010011000011111100111111100111011011111100111111001111111000100101010001 e19f3f3f97543f84b9e2a1848b3fe3a03f3fe84c3f3f9dbf3f3f8951
EUC-JP 癲??裕?┿筍щ?罌??鎰??攸??渦 11100010101000010011111100111111110011011011010100111111101010001011101111100100101000111010011111101011001111111110011010100010001111110011111111101111101011010011111100111111110110101100000100111111001111111011000110110010 e2a13f3fcdb53fa8bbe4a3a7eb3fe6a23f3fefad3f3fdac13f3fb1b2
UTF-8 癲븍쵉裕낉┿筍щ츉罌삼퐦鎰꾣뀆攸됱돖渦 1110011110011001101100101110101110111000100011011110110010110101100010011110100010100011100101011110101110000010100010011110001010010100101111111110011110101101100011011101000110001001111011001011100010001001111001111011110110001100111011001000001010111100111011011001000010100110111010011000111010110000111010101011111010100011111010111000000010000110111001101001010010111000111010111001000010110001111010111000111110010110111001101011100010100110 e799b2ebb88decb589e8a395eb8289e294bfe7ad8dd189ecb889e7bd8cec82bced90a6e98eb0eabea3eb8086e694b8eb90b1eb8f96e6b8a6
UHC 癲븍쵉裕낉┿筍щ츉罌삼퐦鎰꾣뀆攸됱돖渦 1110111110100110101110101110101110101100100010111110101110101110100001011110111110100110101110111110001011101100101011001110101110101110100001011110010110100010101110111110111110111101100011111110110011110000100001001110011010000101100000101110101011110010100010011110110010001001101000001110100010111110 efa6baebac8bebae85efa6bbe2ecacebae85e5a2bbefbd8fecf084e68582eaf289ec89a0e8be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)