To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??攸??瑤???ι?蟻??億??? 00111111001111110011111111101000111010000011111100111111100111011011111100111111001111111110101010100010001111110011111100111111100000111100011100111111100010110110000100111111001111111000100110101101001111110011111100111111 3f3f3fe8e83f3f9dbf3f3feaa23f3f3f83c73f8b613f3f89ad3f3f3f
EUC-JP ???韋??攸??瑤??佾ι?蟻??億??? 001111110011111100111111111100001110101000111111001111111101101011000001001111110011111111110100101001000011111100111111100011111011000011111011101001101100100100111111101101011100001000111111001111111011001010101111001111110011111100111111 3f3f3ff0ea3f3fdac13f3ff4a43f3f8fb0fba6c93fb5c23f3fb2af3f3f3f
UTF-8 玲곷씭韋껆뙴攸껊젧瑤녹슦佾ι쉬蟻숇쾴億됰뀘溜 1110111110100110101011011110101010110011101101111110110010010100101011011110100110011111100010111110101010111011100001101110101110011001101101001110011010010100101110001110101010111011100010101110110010100000101001111110011110010001101001001110101110000101101110011110110010001010101001101110010010111101101111101100111010111001111011001000100110101100111010001001111110111011111011001000100010000111111011001011111010110100111001011000010010000100111010111001000010110000111010111000000010011000111011111010011110001011 efa6adeab3b7ec94ade99f8beabb86eb99b4e694b8eabb8aeca0a7e791a4eb85b9ec8aa6e4bdbeceb9ec89ace89fbbec8887ecbeb4e58484eb90b0eb8098efa78b
UHC 玲곷씭韋껆뙴攸껊젧瑤녹슦佾ι쉬蟻숇쾴億됰뀘溜 1110011110111111100000011110101110011101101111101110101011011111100000111110011110001100101101111110101011110010100000111110101110100000100111111110100011111101101100111110110010011010101100001110110011101011101001011110100110111101101011001110101111111100100110011110101110110010100010101110010111100010100010011110101110000101100100011110101011111110 e7bf81eb9dbeeadf83e78cb7eaf283eba09fe8fdb3ec9ab0eceba5e9bdacebfc99ebb28ae5e289eb8591eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)