To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???乙ユ?營???→?擬??臆??轅??^ 0011111100111111001111111000100110110011100000111000011000111111100110100111101000111111001111110011111110000001101010000011111110001011010110110011111100111111100010011011000000111111001111111110011101110110001111110011111101011110 3f3f3f89b383863f9a7a3f3f3f81a83f8b5b3f3f89b03f3fe7763f3f5e
EUC-JP ???乙ユ?營??堉→?擬??臆??轅??^ 00111111001111110011111110110010101101011010010111100110001111111101001111011011001111110011111110001111101101111111110110100010101010100011111110110101101111000011111100111111101100101011001000111111001111111110110111010111001111110011111101011110 3f3f3fb2b5a5e63fd3db3f3f8fb7fda2aa3fb5bc3f3fb2b23f3fedd73f3f5e
UTF-8 黎싳뼏乙ユ늿營뚭였堉→콢擬쀬궖臆덄퐣轅곗죸^ 11101111101001101000100111101100100010111011001111101011101111001000111111100100101110011001100111100011100000111010011011101011100010101011111111100111100001111001111111101011100110101010110111101100100110001000000011100101101000001000100111100010100001101001001011101100101111011010001011100110100100111010110011101100100000001010110011101010101101101001011011101000100001111000011011101011100011011000010011101101100100001010001111101000101111011000010111101010101100111001011111101100101000111011100001011110 efa689ec8bb3ebbc8fe4b999e383a6eb8abfe7879feb9aadec9880e5a089e28692ecbda2e693acec80aceab696e88786eb8d84ed90a3e8bd85eab397eca3b85e
UHC 黎싳뼏乙ユ늿營뚭였堉→콢擬쀬궖臆덄퐣轅곗죸^ 11100110101100011001101011101100100101101001011111101011111000001010101111100110100010001000100011100111101111011000110011101010101111111011010011101011101111001010000111100110101100011001101011101011111101001001011111101100100000101010101111100101111001101000100011100111101111011000110011101010101111111011000011101100101000011001001001011110 e6b19aec9697ebe0abe68888e7bd8ceabfb4ebbca1e6b19aebf497ec82abe5e688e7bd8ceabfb0eca1925e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)