To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???意③?擬???ラ?畏??撓?????^ 00111111001111110011111110001000110100111000011101000010001111111000101101011011001111110011111100111111100000111000100100111111100010001101100000111111001111111001110110011010001111110011111100111111001111110011111101011110 3f3f3f88d387423f8b5b3f3f3f83893f88d83f3f9d9a3f3f3f3f3f5e
EUC-JP ???意??擬???ラ?畏??撓?????^ 001111110011111100111111101100001101010100111111001111111011010110111100001111110011111100111111101001011110100100111111101100001101101000111111001111111101100111111010001111110011111100111111001111110011111101011110 3f3f3fb0d53f3fb5bc3f3f3fa5e93fb0da3f3fd9fa3f3f3f3f3f5e
UTF-8 樂꾦닄意③뾿擬⑸젇列ラ썑畏띿뵪撓뉗뼦溜딀씟^ 11101111101001101011111111101010101111101010011011101011100010111000010011100110100001001000111111100010100100011010001011101011101111101011111111100110100100111010110011100010100100011011100011101100101000001000011111101111101001101001110011100011100000111010100111101100100011011001000111100111100101011000111111101011100111011011111111101011101101011010101011100110100100101001001111101011100010011001011111101011101111001010011011101111101001111000101111101011100101001000000011101100100101001001111101011110 efa6bfeabea6eb8b84e6848fe291a2ebbebfe693ace291b8eca087efa69ce383a9ec8d91e7958feb9dbfebb5aae69293eb8997ebbca6efa78beb9480ec949f5e
UHC 樂꾦닄意③뾿擬⑸젇列ラ썑畏띿뵪撓뉗뼦溜딀씟^ 11101000111110011000010011101001100010001000110111101011111100101010100011101001100101111000011111101011111101001010100111101011101000001000101011100110111010101010101111101001100110111000010011101000111001101000110111101100100101001010100011101000111101011000011111101100100101101010100111101010111111101000101011100110100111011011001101011110 e8f984e9888debf2a8e99787ebf4a9eba08ae6eaabe99b84e8e68dec94a8e8f587ec96a9eafe8ae69db35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)