To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???韋?????蹂????????袁⑤?B 0011111100111111001111111110100011101000001111110011111100111111001111110011111111100110111110000011111100111111001111110011111100111111001111110011111100111111111001011100110110000111010001000011111101000010 3f3f3fe8e83f3f3f3f3fe6f83f3f3f3f3f3f3f3fe5cd87443f42
EUC-JP ???韋??獒??蹂??洧?????袁??B 0011111100111111001111111111000011101010001111110011111110001111110010111011101100111111001111111110110011111010001111110011111110001111110001111011010000111111001111110011111100111111001111111110101011001111001111110011111101000010 3f3f3ff0ea3f3f8fcbbb3f3fecfa3f3f8fc7b43f3f3f3f3feacf3f3f42
UTF-8 玲곷씭韋껎슎獒붽쑬蹂쇘퓖洧얜걙淋욆짅袁⑤낵B 11101111101001101010110111101010101100111011011111101100100101001010110111101001100111111000101111101010101110111000111011101100100010101000111011100111100011011001001011101011101101101011110111101100100100011010110011101000101110011000001011101100100001111001100011101101100100111001011011100110101101001010011111101100100101101001110011101010101100011001100111101111101001111011010111101100100110101000011011101100101001111000010111101000101000101000000111100010100100011010010011101011100000101011010101000010 efa6adeab3b7ec94ade99f8beabb8eec8a8ee78d92ebb6bdec91ace8b982ec8798ed9396e6b4a7ec969ceab199efa7b5ec9a86eca785e8a281e291a4eb82b542
UHC 玲곷씭韋껎슎獒붽쑬蹂쇘퓖洧얜걙淋욆짅袁⑤낵B 11100111101111111000000111101011100111011011111011101010110111111000001111101101100110101001111011101000101000111001010011101010101111101010100011101011101100111011110011100111101111111000000111101010111110111011111011101011100000011000001111101100111110001001111011101000101000111001010011101010101111101010100011101011101100111011110001000010 e7bf81eb9dbeeadf83ed9a9ee8a394eabea8ebb3bce7bf81eafbbeeb8183ecf89ee8a394eabea8ebb3bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)